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LeA33955-US 
SEQUENCE LISTING 

<110> Bayer Aktiengesellschaft 

<120> Nucleic acids coding for enzyme activities of 
spinosyn biosynthesis 

<130> Le A 33 955 

<140> 
<141> 

<160> 55 

<170> Patentln ver. 2.1 

<210> 1 
<211> 50000 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 1 

ggcaccaccc cggggccccc gcctcactcg gagtcgatcc gcattccggt cacgaacatg 60 
tgacccgtcg agtgcgtgat cattagctcg gtcttcgcct cgatcgccgc ctgctgcggg 120 
gtcacgccgc aggcccagaa caccggcacg tcgcccggct ccatacgcac tcgttcaccc 180 
cagtcaggtt catcaacgtt gtcgatgccc aacttctcag gggagccgat gtgcacgggg 240 
cccccgtgca cctcgggatg cgcatgcgtg atggtgacag cctccgccac ccggtccggg 300 
ggcatgggac gcattgaaac cactagcttc ccccggaact ggcctgcggg cacacactca 360 
cggttgctga ggtaaaccgg gccgttgatc ccctggtcga tgtgccggat cgggagtccg 420 
gcatcgagca gcaagtgctc gaaggagaag ctgcagccaa gcaggaacgc gaccatgtcg 480 
tcggtccaca ggtcaacgac gtcactgcgc tcctccacca gcgccccttc ccggtacacg 540 
ctgtatttgg gcaggtcggt gcggaggtca gcaccagggg cggtcagctt cggctccgga 600 
tctcccgcgt cggtgacctc gaggatcggg cacggcttgg ggttccgctg ggcgaaccgc 660 
atgaagtcga acgcgtcatc ctggggcagc atcacgaggt tggcctgcac gaatccatga 720 
cagatcccgg acgtgggcct cgcccacgca ttcgcgcggt tggcacgcca tgcctcgatg 780 
ggctccgcgt gctgcagatc aacgaactcc atcgatgact tccccttccc ggcgatgatg 840 
acgaaggctg gcaaccgagg gttgcccgct acgggtggcc gtcactggtc tgccacccag 900 
gggtcgccga gcaggctctg gtgaacgatt ccggagatca gctcggcatg cgcggcgacg 960 
aaacgcagtg ggatcccgtc gtcgccttgg aacgcgggat cgtcaacgcc gccgctgcgt 1020 
agacctgcgc gcagccccag ctcggtccac tcgcgcacca gatccccggt gactgccagg 1080 
tttgcgcgtc gagccaggat ggccaccgca ccggtgatga cgtagcagcc caaattcgac 1140 
acggatgcag gcagcagatg atccaccgtc gtggacgcga agaatccgcc ctcaacgctg 1200 
acgcctcgcg gagtcagcgc ctcggactcg gcggcaatgt tgccgaaccc gatctcgttg 1260 
ccgttgtctc ccacggccag cgtcagagca ccgttgtccg tggcggcgcg aaggtagtcg 1320 
tcggcaaaca gatcaccggc ggtgaccggc gtgccccaga tcagatgacg gccgccaaca 1380 
gagttctgcc ccagcttctc gatggcggcg acgacgccga actgcgccgc gaactcccgg 1440 
gcctcggccg cgctggagaa tcgggactcc atcaaggtca cgccatccag gcccgcagcg 1500 
tcgaccagct tctccaccgg gccgacggcc tctgggtcga tgacgatggt cacctcgcta 1560 
ccaagaccgg cgagcgtgcg cgcgagtgcc agagagccga tggggccgtc cacctccccc 1620 
tgcgggaacc ggccttcctc gaccagaccc. gtgacgatca acgccttttc accctgtcga 1680 
cgcagcagag agcgcgcgat cgcaaggctg atcggctccg accgctcacc ccgcgccagc 1740 
gtgtagaacg ggctcacgat gccggccggg ctcgtcggcg gacgcatctc gatcgtggcc 1800 
agtcgatcca ggctctcgaa ggccgggtgc gactcgggct tcatcagctc tccctcggta 1860 
tacgacatat cccaaggact tcatccgcta cggagtcccg cctcgggcgg ttgcttacgc 1920 
ggtgaggacc atataccgta ttccactgtc caaggaaggg ggacgatgga accgaatcag 1980 
ccacgcactg cgggcacgct gaacatggtg gagctgcacc atcgcccagc cgccgcagac 2040 
tccgcggtgg ggtccggcaa cggtatgctc gcacccctcg acgcacgcgt cgacaatctg 2100 
acagatcggg tcttcgaggc catccgcgac agcatcgtaa gtcaggcact cccacctggg 2160 
cgtcgtgtga gcgaggccaa gctcgcggcc gagctgaatg tcagcaagac ccccgtgcgc 2220 
gaggcgcttc tccgcttgcg gcacgtgggc ctggtcgagg ctgccccgaa ggggctgcgc 2280 
gtcatcagcg cctcggtccg cgccattcgc gacgcctacg agcaccgggc ggggctggag 2340 
agctcggcag cctggttcac cgcacatcgg gctaccagcg cccaacggga gaggctccac 2400 
gaggtcgcca cctcatccct gcgcgcggca gagtccggcg actccgccca gttccgcgcc 2460 
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gaggatcgcg tcttccacca catcatcgcc 
gcggtcgaca acgccctggt gctgaccgct 
ggtgactcca tctcctgcgc ctcggagcac 
gacgccgacc gggcagcccg gaacaacgcc 
ctctcctcac acacgcagcg cgggaacgtg 
accccctacc tcaccattcg accgaaggac 
tcaccgctgc ggaaggagag atccccgttg 
gggagataga ccccgagctg gcggtcacga 
tcgcgcggct gcgtgagcgt gtcaagaacg 
aggtgcggtt caccgcgccg taccgatccc 
acggcgatca cgcagcggac ctgagcgagt 
agtgcgacca cacgatcatc ggcccggatg 
gcaccacgtc cgaggctgag atcgggctca 
agtccgaggc gatggactac gtgtgggggg 
acatcttgcg gatcaatccg cgctatctca 
gcttcggacc ggagctcgta ccgctcgatg 
acatcgagat ctcgacggtc aagaacggcg 
tgacccacaa gcccgccagc ctcgtgagct 
gagacatcat ctccaccgga acccctggcg 
agtgccggat cgacggcatc ggtcagctga 
gagcactcac gcggtcgagc agctcgaccg 
ccgcgagacc gtcctcgacg acgcactcaa 
cggagaggtg gaccagcaga actggaacgt 
cacactccac gcacgcgcta tgaacgccaa 
caacaacgcg cttctcgccc cgcacggcaa 
tcaggcgtct cacggtgcgt gggcgttcac 
gcgccgctac ggggttcagc gcatcatcct 
gcgttgggtg actgccgaaa tcgccagcga 
cgatgcggac acggtgcggg cgatgtccga 
cctcaacgtc ctcctcgagg tcggagtgcc 
ctcggcgctc gaggtcgccg cggcggtcgc 
cgaggcctac gagggcctcg tgaccggcgg 
caccttcttc gctggcgtcc ggtcgatcgt 
cgccaaccgc atcatcgtga cggcaggcgg 
actcggctcg tgggacggtg tccgcgacga 
catctcccac gacgccggca agtacgaaaa 
gcaggaaccg ctccggctcc acaacgcact 
ggagcctgat ctcgcgatcc tgtcgatcgg 
gccacagcca cgcgagctgt ttcgcgcgga 
gacgttcaag ctgatggacc agcacgcctt 
tcccggagac atcgtcgctt tcgacatgtc 
cttcatcccg atcatcgacg acgacttcaa 
acggggctca ggacctccta cgggaggctc 
tcagttggac atgaccacca gccacagaga 
acagacgtcg accttcctcc cgcgggaggc 
atcgtggcaa tcgccgggca gtgcggctac 
ctggaacctc aggtgcgccg ggcattcgag 
tgcagcgagg cagacgtgat caccgtgaat 
gcgatgaacg cgatctaccg cgagtacttc 
accgccggtc tccgacccgg tgtcctgttc 
tgaccgggcc actgccactc ctcccaccaa 
tccgggccgc cgtcacccga aaccgcaagg 
agtcgccctc gtactcggag ccggcggcgg 
ccgggaaggc gcgcgtgtgg cgctggccga 
ggcgtccatg accggcgata acatgagcct 
cgatgagaac gtcaccgccg tcgagagcga 
caccggcggc ccgccgccca cccgtgtgtc 
cttcgaggcg atgatcctct ccgtcgtcgc 
cgagcgggga tggggccgga tcatcacgtc 
caacctcggg ttgtcgaacg ccctccgctc 
cggtgaggtg gcaagcgacg gcatcaccag 
tgcgcgcatc gcccagttgg accaggcaaa 
catcgcgagg gaatcagaga gctctattcc 
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gagttctgcg ggaacgagat cctccagcgc 2520 

gtactccgag aacgcgacac tccctcgacc 2580 

atcgcgacgg catccgccgt tctggctggc 2640 

gaccacatcc tccacgtgat gtcgcttgtc 2700 

cccccgggaa cgagttaaac caactccccc 2760 

gggctttcgt gaccgagcac ctggcgacca 2820 

tcgtgagcga ccgcggcgtc gcagcaatcc 2880 

ccatcatgga cgtgatcgca ggagaccacc 2940 

cccccgacct cgccttcaca cgcgtggacg 3000 

cccggaagat ctggggcatc ggtctgaact 3060 

cggcgcccga ccagccggcg tccttcgtga 3120 

agccgatcgt gattccggcg cagagtgagc 3180 

tcatcggcag gacttgcgag aacgtcagcg 3240 

tcacgaccat cctggaccag accgccgagg 3300 

cccgctcgaa gaacttccgg acgttcttct 3360 

aggtgataga gaagttcgaa agcttggacg 3420 

atgagttcag gaccaacacg gtcgcccaca 3480 

tccactcgaa gatgatgccg ctgtttccgg 3540 

cgctcgtcat cggcgacgga gaccgtgtgg 3600 

ccacgacggc gcgaaaggcc actacacgat 3660 

gacccgactc ctgcaatgtg ccgacgacct 3720 

gggcgtgccc ctgggtgctc gtgtgctcct 3780 

ggcccgcgga gacctcgccc tgccggtgac 3840 

cctggcgact atggaggcgt actgcgagcg 3900 

gaccaccatg tcaccccagc tcttcgccct 3960 

ggcagcgacg cccactcagg tggccgtgat 4020 

ggccaacgaa ctcgtcgatg ctcccgcgct 4080 

cagctcgttc gacttctact gcctggtcga 4140 

taccatcgcc gactccggcc acggggtcac 4200 

tggtggcagg tgtggcgtgc gagacctcga 4260 

cgcgaccgag catctccggc tggtcggcgt 4320 

catgacaccg gaggacctcc atgcactcga 4380 

gctcgagctg gcacgtcgcg acctgttcga 4440 

aagctcgtac ttcgatcgtg tcgtcgccgg 4500 

cgtggaccta gtgcttcgaa gcggctgtta 4560 

gctctcgccg ctcgctggcc gacgggccga 4620 

cactgcatgg gcttccgtgc tgtcccggcc 4680 

gaagcgcgac gcggcccatg acctgacgct 4740 

tggctcgagg cagcggctgc gggaagcgga 4800 

cgttcgcgtt gcccccgggc tcgcgatcgc 4860 

ccacccgtgc acggccttcg acaagctccc 4920 

cgtcgtcgac ggcgtcctca cgttcttctg 4980 

cggttgaacc gaccccacaa gctcccgaag 5040 

ggccccgaca ccgtgtcccg tcaggcgttc 5100 

ccttactccg ccgccgtacg catcggcaac 5160 

ctcgccgaca gatccctcgc accgggtggt 5220 

aacctcatga ccgcgctgcg ttcagccggc 5280 

gtgttcctcg ctgacgggga cgacttcgac 5340 

agcgagcccg gcccggcccg gacgaccatc 5400 

gaggtcagcg ctcaggccgt ggtaggcggc 5460 

tgtccctcca tcaacggcgt gccacgactt 5520 

acatgtcatg gatctcgcaa tcaatggaag 5580 

actcggttcg gccattgccg cctcgctaag 5640 

catcgacaag gacgcgctca cggaggtcga 5700 

ggtctgggac ttggcgaacc tgtcggtcat 5760 

gctcggcccg gtggagatcc tggtcaacat 5820 

cgggcaggac gccaccctgt ggcgtcggca 5880 

gatcacagac cgcgtgctcc cgggcatgac 5940 

gacttcgtcc ggcgtcatcg cgcccatccc 6000 

gagtctggtg ggctggtcca agacactgtc 6060 

caacatcgtc gttccggggc gcatcgccac 6120 

ggccgcacgg gagaaccgca ctgcggagaa 6180 

ggtcggccgc tacgggcggc gcgaggaatt 6240 
Page 2 



cgccgacgcg gtcacatttc tcgccagcga 
ccgtgtcgat ggcggactgc tccagaacgt 
cagcgcaacc cgaagccatg tccgctcctc 
aaactcatca cctacgtgcg ctgatctacg 
atcacgatca gcagcacgga cctcgcctcc 
cggttttcgg gtcgccggcg acccagcttt 
tgctgcctgg aaggcgtact tcaggtatgg 
atcgtatctc tggaaagagg tcgtccggtg 
gaaacagcta cgccgacgct acggcatcca 
tccggaaggc cggctcggga gacgtgcagc 
gacgacctgg cactaccagg acagctcgtg 
cctccgatct tgctaacagt caagcggcga 
cacccgcgct ggcctcacgt ctgtgcgcca 
gaaaggaacc catcgaatca tgacgaaccg 
cgacatcgcc cgcgcatttg taggcccagc 
cagcgtcgtc agcggcgacc tggcccgagc 
gggccatgac gaactcaccg tggcgcttag 
gagtatcaac tcggcacatc acgatcaagc 
cctgtgcgaa aaaccgcttg ccctgaccgt 
ggagaaggca ggcgtagtgt tcgcgacaaa 
tttgatccgg gatgcaatcg ccgccggcga 
gaacgcgatc cgactgccgc ccgccgcgcg 
cgccggagtc gcactcgacc tcacggtgca 
tgccgatccg ctaacggtcg tggcccatac 
cgatgagaca atctccggaa gcgcggagtt 
aagcttcgtg accggacatg cccccacatg 
catcggcacg ggtatccagt cgatgtcacc 
cgggtccagg aagatcgatc tgggtacccg 
gttcatcgag gctgtgcgtg gcgaggccga 
gtcgttggcc ttcgcacggg cggcgctgga 
gggacaggct cgatgacgaa agctgggcgc 
aaccgggcat cggcagacgt ggcgatcgtc 
ggggcgctcg ctggtcgcgg actcgatgtg 
caagagcccg ctatcgacgt gtgcaaggct 
ggtggctcga tggtgctggg agaccgttca 
cagccaaggc ctacgaagct ggcccgtggc 
gtcctatgtg ggcggccgca actacatgca 
caatggccag cctccgatca tggaccatac 
cggacgatta gctgaagcgg atgtccgctt 
gtgttgccgg gccgcgaacg gatcatgtga 
acagagcagg tgactacgtt atctctgccc 
ggtaaagccc agcttggaaa ggatgttctc 
gaccaggtca gccgcaatct ctttgtttgt 
gcgttctcgt tttgtgagaa ccggttcggt 
cggcggtggc ttctccccaa gagcatagaa 
ctggcctcgg gcggtttcgt attcggtgtc 
ctgctcacgc caggtcagta gcgtctcgga 
cattttatcc aacgccccga aaaggatcgc 
gacccaggcc agcagttcaa cgcagtcgct 
attgcggagg gacctggacg cggaggcggc 
agtgatcgcc atgttccact ccgcccacga 
gatcttgcgg cattcgtgca gaaggaccat 
tgcctgcgct cgtattgaga aaatggctaa 
gtggcgggcc agtgcctcgt cgagcagttg 
ctcccggtcg ccaagaaact gagtcgcgta 
ggccagctcg cggctttcgt taagcaggac 
ggcaatccat ccggtgatcc acagtgcgcg 
ggccaacgcc ctgtccagcc acatgcggcc 
gaaccacagc gcgctcacca tccgcaaccc 
gtagtccaac gctgcccaga agttgccgcg 
ttgatgcggg cccgcggact cggcatcgga 
gtgccggcag cgcagagcgt tttcggcacc 
acggatagtc tccagcatcc ggtatctcgt 
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gcgtgccagc tacatcaccg gatccgttgt 6300 
gtaaccccga caaacatcct gtctccagat 6360 
ggggtcactg acgtcagcga tcccgagccc 6420 
gtcatccagg gctacccact cgccagcggg 6480 
agggtgcacg ctgcgcgcga gcggtgccgt 6540 
tccccacatt gaaagctatc cggaaccgga 6600 
atgcccgaac gcgaccttca gctacctacg 6660 
gcaggagcgc aaacaccggc gtactccctg 6720 
gccagccgac ggcgatgtcg ggctgttcga 6780 
cgggtgggga aggagcgctg acaaaatcct 6840 
ggcaagcgag atcaggcggg gtatgcgaaa 6900 
cccgtttcat ggtcgtgtaa tcctcgcggc 6960 
gcggaaatgt cgcacaggcg cggtattccc 7020 
tcccttccgc tggtcgctcg tgggcgccag 7080 
gatctccgcc cagcctgacg ccgcggtggt 7140 
gcaggatgta gcggcaccat tcggcgcccg 7200 
cgatcctgac gtcgacgccg tgtacatctc 7260 
acttgcggcg atcgcagccg gcaagcacgt 7320 
cgccgaggca caagaacttg tcgatgccgc 7380 
ccaccacatg cgcaactccg tgccacatcg 7440 
catcggcgag ccggtcgcag tgaacgtccg 7500 
gagatggcgg acgaccgatt cggcagcggg 7560 
cgacgcggac tgtctccggt tcgtactcaa 7620 
ctcgtccggc cacatgacga gggtgggaat 7680 
cgtcagcggt gtgcatgctt cattcgtgga 7740 
cctggaggtt ttcggaaccg aaggagcgct 7800 
ggtcggcacg ctgacacacg tcggtccggc 7860 
cgaggacctc tacgtcgtgg gtgttcgacg 7920 
gcccccggcc tccggccaag acggggtctg 7980 
ggctgctcgg accgggcatc ggcagactgt 8040 
agttaggcac aacagatcat gcacaaggac 8100 
gggtccggtg gaggctgact gatcgtcgcc 8160 
ctcgtgttgg aacgggacga gtttttggct 8220 
tcgcgcacgg gcactacaag ccggacgaac 8280 
caccggctac tcactactcg tcggcggcaa 8340 
tcaatctata ccgcgtaagc tcgccaacgg 8400 
gcacccgaac tccgcggtca tggccatcgg 8460 
ctacttcaac atgcgccgtt gcgggggccg 8520 
tcggtatcgc agaccttgcc acgacgggcg 8580 
gccgggctgg cccgagaagg gcatgggcca 8640 
ggtcatccaa ctgatgacct gtgcgcgcga 8700 
gacgtgggcc tcggctgtgc gctgggaaat 8760 
tttgccagcg gtgaccaggg cagcgacctc 8820 
ttccggagcc gaagcggcgg ccgcttcgac 8880 
gatagtctgc tcccggttca tccgcgtccc 8940 
tgtgagtgct tcctgggcgc gtgccttggc 9000 
tccgaacagc ggggtgccaa tcagttccca 9060 
cgcgcgttca ggattgcgtt ctgcggcggc 9120 
gatcccgagt aggtcgttga gttcccgctt 9180 
tgccttggtt gggtcgcccg cggcccacca 9240 
tagcctccag cgctcgccaa gttctgtaca 9300 
cgcccggtca aggtctccaa gcaggcccgc 9360 
ggcggggccg gtccagtggc cagttgcgcg 9420 
cgtggcgcga gtcaggttgt tgtcccacat 9480 
ggtaaattcc gtttcgtcgc ccagtttccg 9540 
cagcgcccca tctcggtccc cttgccggtg 9600 
ggcacgctgg ctggtgggtt cggtctccag 9660 
ctccctcaga tagtcgccag cagcccagta 9720 
aaccgacgcc tcgccaggtg tggtgaagca 9780 
ctcggcgcgc agacgcgcga gccaaccggc 9840 
ctgctcggcg agatgcaggt agtagtcgcg 9900 
cgcctcggtg agccggtcgt gcccgtattg 9960 
cttcgtgcgc tcctcggtcc gggtgagcac 10020 
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ggacttgtcg atcagtcccg cgagagcggc 
ggtgcagacg ttttcggctg cgtcgagatc 
ccacaactgc ttctcatgct cattgcacag 
ggtctgatgc cgaggatcgt cggcgcggtc 
cattcgcgtt agaatttgcc cgactgagag 
cgctagggga atgccgtcca gcctccggca 
gttgatggtg aaccctggta gcgccgccgc 
acgctggtag ggacgacctt cctgttgtga 
cagtggcgga accggccagg tctgttccgc 
aagtatccgc aaccgcgggg ccgcggccag 
cagcaggtgc tcgcagttgt ccaacaccag 
gagcacggcc tccggggcac gtgtcgaacg 
ctgcggcacc agcatgggat cgcgaacgtt 
gaaggcccgc cgcacgtcgt gtgcgacgtg 
gaaaccagtc agggtcacta accgtgaact 
ctgacggcgc cccacaaagc tcgtcacctc 
aaccccagac acgactcacc gcacctcggt 
ggctggcagc gcacttacag cttgaacaac 
gttcgttacg atctcggccg atgttgatcg 
gctggcctgc ccagcctgta ccactgatca 
ttggcactgg tcaggccact gctggcggtt 
ggcggcatcc gctccgctac tggtgcgcga 
tgagctcgtc gtggaggcag cggaccgggt 
gatcaatcgc tcacgtggcg agatctcgca 
caccttcatc ctgttccccg aggtgctgac 
tgtggtgatt gttcaagccg agccggatgc 
cgacctagtg atcgccgagt cctacccgca 
gacagagatc ctgatgaccg accgcatgca 
gcccgtcggc tgggggcagg cgctttcgcc 
ccaacgtctt gcctgctcga agttgtcccg 
gatcttgacg ctcatgcatg gaggttcacc 
cggcggtcgt tgcgccagcg gcgatcatcc 
agaattggct caggccggga gtttggacga 
cacacgccat gaccagtggt gtccgtccgc 
cgaccagcgc cacgagcgcg accgtgttgg 
ggatccacgc acaaatggcg gccccggcgg 
tcagcgcagc gcccagcgag gcttcggggc 
tcgggggtgg tcgcaccgcg atcgccatcg 
gcagcaactg gcgggcgttg aggctgcccc 
agccgatctc gccgaggttg ccgccgagca 
acatggcgcg gccttccaca atggcgtcga 
cgaggtcggc ggcctcgcgg gcggcggggg 
cggccaaccg gatcgccggg gcgtcgttgg 
cgttgcgttg cagggcgtcg acgatgcgcg 
ccgcgacggt ggtcacttcc gccgccagtt 
cggtcatgac gcgacgctcg ttgagcgcgt 
tggggtggtc gccggtgacc atcacgacct 
cggcggccgc agtgggccgc accgggtcgg 
cccggatccg ggactcgtcg agatcactgc 
gcacccggta gccctgccgg gcgaggtttt 
tctcctccaa cggcacgtcg ccatctgatc 
tctccggggc acctttcaca ctcagcagat 
ggtagccacg ggaaggctcg aagggcatct 
cggccgggtg cactcccgcg ccacgcgcgc 
ggagcagttc tccgttctcc accaacggac 
gttccactgt ggagatttcc tcgagccacc 
gcgcgatgcg cccttcggtg agggtgccgg 
ccaaagcctc gatcgtcgcg gagttacgca 
cggccgcgag ttcggcgacg gtggcaacga 
ccaggctcac cgcccgcccc agcgatgtcc 
cgagcagaat cccacccgct cccacgcaga 
ggcgagccgc gacaccatcg gcacctcgcc 
cggtgttcgc gccggtcgcg accacgacac 
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gaagacgtca tcatcggtca gcccgtcgcc 10080 
gaattcgtcg gtgaaaaccg aacagcgcgc 10140 
gttgtagctc cagtcgacgg cggcgcgcag 10200 
gccggtggtg agcagccgga atcggttctc 10260 
cacgcgaacg cgcacggcgg cgagttcgat 10320 
taacctggcc acgatctgtt cgttttcttc 10380 
ggcacgctgc tcgaacaagg tcaacgcttc 10440 
cgttgatgaa tcgctgccag caggcacgga 10500 
cactatccct aaatgttctc ggctggttgc 10560 
cagcacagcg actaggcggc tgcaggcccc 10620 
caagacttgc ttgtccgcca agtaatcgac 10680 
gtcatggatc ccaagagcag ccgcgaccgc 10740 
agccagctcc accagaaatg tcccggcgga 10800 
cagcgccagc cgggacttcc cgatcccgcc 10860 
ggacagcgcg cgcttaactc ttgtggccgc 10920 
atatgggagg cgaccagcct cccgaaccgc 10980 
agaactagcg ttttattgta gcccggtcat 11040 
tgttcgaacc acttcgaacc caccgacggg 11100 
tggtgggtgc ggggagggcg tgcgttgccc 11160 
aggtcgccga ccgggcccgt gtgggcgatg 11220 
atgcggcggg ccagctgcgg cggcacggtg 11280 
aggccgtcgc gtccgcctga cggcgaaagg 11340 
gatcgctgag ctggagacgc tgcgcactca 11400 
acgtctgcgt gtggcctcca tgcagaccgt 11460 
tcgtctacgc cggcggcgtc caggactgga 11520 
ggctcttcag ggtctccagg tcgacgacta 11580 
gtggccggtc atctatccgg acgggctcac 11640 
cgtcgccgca gctcccattg tgggtccacc 11700 
gctggcgagt cgttcaccgc tctcggcgct 11760 
agtcgggccc gacgatggct gtggtgaagg 11820 
attgttggag caagaggctg atgacggtgg 11880 
agccgtgtgg gagtagcggg ctggctccga 11940 
cagcggccag tgccacgaac gagacaagcg 12000 
gcacgaccag ggtctgtccg agctgggcgg 12060 
cctggcgcag ggttccggtc gccctgccca 12120 
tgaccgcggc gcggcggtag atgtcgcggg 12180 
cttctgccag cagcatctcc gcagtgatcg 12240 
ccgggaggat gtcggtgagc aggttgacca 12300 
cgccgttgat cacaccggtg acgacggtga 12360 
ggatggccag cgagtcacgc accgatgccc 12420 
cgatggtctc gatgcggtca tcggtgacca 12480 
ttgcgcgctc gcccagcgcg atgccgatgt 12540 
cgccgtcccc ggtcaccgcg accacacggt 12600 
ccttctgcgc cgggctgacc ctcgcgaaca 12660 
ggtcgccgtc gagggtgtcc agttcggtgc 12720 
cgagttcggc ggcgatcgcc tcggcggtgc 12780 
cgacacctgc ctgctggagc tgcgcgacgg 12840 
ccagcgcgac gaaaccgatc agccgcagat 12900 
ggtccgaggc ggcccgctcc gcgacagcca 12960 
cgacaactcg ctcgacgcct cgacgagcgg 13020 
tgcgccaccg ggtgcactgg gccagcacga 13080 
tgccgtcgcg ccaggtgccg agcaccgcgt 13140 
cgtcgatcgg tacccagcgc tgcttgcctt 13200 
catcgacgac cgcgcgatcg gtgggatgcg 13260 
ttgcccgcaa ggcacccgcg agcactcgcc 13320 
ggctcgacgc gccatcggat acctgccgca 13380 
tcttgtcgaa acacaacacc tgcgcccgcc 13440 
ccagcgcccc gcgtttcgac agccgccggg 13500 
acggcaatcc ctcgggaacg gccgccaccg 13560 
ccaatggacg cctacgcagc atgtccgtga 13620 
tcggcaccgt caccttggcc agcgtctgca 13680 
gggcatcggg cccgatcgcg ctgcctgcct 13740 
ccaccgcccg cccggcagcg accgcggtcc 13800 
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cccggtagat catgcacgtc cggtccgcga ccggcaccgc cggagtcgcc gtcaccgtct 13860 
tggtgaccaa catcgactcc ccggtcaggc tcgactcatc gacctccagc ccgtcggcct 13920 
ccagaagccg gcagtcggcc ggaacagcat ccccggccac caactccacc acatcgccga 13980 
gcacgagctc gtcagccggc aactgcgtca cattttcctt acggcgcacc acaacccgaa 14040 
caacgctggc ctccagaagc ctgttcaaag cgcggtccgc ggtgatgcgc tgcacgccgc 14100 
cgatcaacgc gttcatcccc agcacgccca tgatgatcgt cgcgtccgca acggcaccga 14160 
tactcgccga gatcgcggcc ccacccgcca acgccggcgt caacgggctc gccagctcct 14220 
ccagcgacgc ccgcaacaca cccatccggc gaaggcccgt cccctcgtcc tgaccttgga 14280 
aacgctcgtg cgccgcatca tcggtcagac cctgcgatga actgccgagc agctccagca 14340 
ccatgccggg gcccatcgca tgccaggcag tgcgcggcac cgcgcgcgga accggacgcc 14400 
acgcggcctg catcccccac cacgtcccag cgccgagcgc gattgccgca cacacggcca 14460 
ccggaacgcc cgcacggacc tggctcccga tggcaggccc gaacgcgccg aacaccgccc 14520 
cgctgaccga accgatcaac gacagctgcg cgctacgcac gctgacataa cgcgcggtcg 14580 
ggaccgtact caacaacgca cacgactccc ccaggctcgc gcccgagatc aggtgcgcac 14640 
cccacggaac ctcaccacca cgcgtcacac caagcccgac atccgcagcc gccagtgccg 14700 
cccgcgcccg cgcggacacc acaccgacca cgctgccctc ccgctgcatc tcccgcacca 14760 
ccgaagccag ccgcacccca cccgggacca cgcgatcaac accgatccgc cccgccaacc 14820 
ggcccccatc acccgcgaga accaccgacc cgacctcgcg agctgccgcg accaactcct 14880 
cagcgagcgg atccagctcc tcggcagcca gaacccacgc ggccgtgcgc gaatcgtggc 14940 
gcagcgcgaa cacctcgata ccccgaccgg ccacctcgcg cacgtctggg ggcaaccgcg 15000 
accgagcgac cgagtacagc gaccaaccat cacgctgcgc gccgaccccc ttggtcacac 15060 
ccacgagttc atgcgcacgt tcccacaact cgacgaccgc gcgccccccg tccgcgggaa 15120 
tcacgtcatg caccaaccga gcaccggtgc gcagaacgtc cgcgtcgatc accaccgtgt 15180 
cgacccggtc caaccgccgc aacacccggg catcgaacac cagagtcgtc cgctccgaca 15240 
gcccgtaccc gagctgcgcc gcgaacgcgc cccgccccac ctgcgccgcg cgcggcgcgc 15300 
ccgcgaccac catcgcctgc gcacgctgaa aactccgcgt caccccgagc atgcccgcga 15360 
acgccgccgg cccacccaga gccgaaccgt tcgcggcgcg ctcgaccggg ccggcgggca 15420 
acggcacagg acgcggccga acccccaccg gcgcactacg gtgactggca ggatgcgccc 15480 
ccgttccggc ttcccaccgc gcccagctcc gccgcgccgc ctgcgtctcc cgcaacaggc 15540 
agaaccggta ccccgattcc gccagcaaac tcgccggacg ccgcgcaagc gtgttcccca 15600 
acgcactgct aaacccgaac aacaggtcag tggcggtctt cccgatccgc gtttccagcg 15660 
gcccccggac ccaggaggcc gactcaccgg tcgacaccag cgccggaaca gacggcggca 15720 
gcacgggcag cggcagcatc cgggccccca ccgcgatcac agcgccggtc aagttcgcac 15780 
ccagcaccgt cgcctcagcg atcaccttga ccggattgtc cgggtggctc acaccgaccg 15840 
cagcgaacgg ttcctcgtcc aaaccggtgt ccgcctccgc gtcctccacc acacccacca 15900 
gctcagcgac cccaaccagg cccggatcgt gcccgatgac cacccgcccc acgacagcgt 15960 
tgatctcgac ccacgcaaca cccggcacct tctccaaccg ccgggcgatc tgctcgccag 16020 
cctcctcagc ccccggctgg tcaacaccgc gcacctcgac gtacgcacgc ccctccgacg 16080 
accacgcccg ccgcccgcgc cccaagcccg cgatgctctg cgcgacagcc accaccggcg 16140 
ccacgatctg cgaggccaac gcgcccaagg tacgcagcat cgaacacact cagctgtccg 16200 
acgcaaagcc gaccgacgcc cgtgacaccc ccgcgccggt cgcccccgca accggccatt 16260 
cgagccccca accgacccgt accgcgcgaa ccgcgtcctc ccaacgaccc ggcatgcgaa 16320 
catcgacccg acgaaactgt gccacgacgg gcaggctgac cgccgcggct cgcaaccgcg 16380 
ccccttgttg ccgcgcgcct gcgccaggcg ttgtgttccg cgccgtccta gccaccacta 16440 
cctcccgcgc gcacagatac cgtgagtcac gagcgtagga ccgacgagcg gaaagcgaac 16500 
ctcgaaacag ccgcacatcc gtcctctcgg tggcctgcga gctgatcagc aacaccgacg 16560 
gcttcgccac gctgttcgac ccgccgacac accagtagtc ccagcggtgg tgaattcaag 16620 
gccgcctgtg tgctcgggtt cttcgaaacg gctccgagca tgcagctacg ccggacgcag 16680 
gcaacgcaac acctacccag gttcacctgg cctgagcgcc gccgcggcgc gctcagcgac 16740 
gcctagcccg gaacccgcta caaaccggat cacttcgacg ccggtccgcc ggcacgctgc 16800 
gcgaccgcga cacccacgcc aatcgccgcc gccaccggcc agtccatcaa ccccaacagg 16860 
gtcagcgctc caagaccacg atagaaggcc atcgtctgat ggtccgggag aacgccccga 16920 
acggcctcag ccgcgctagc aagatcatca cgagtgggaa tgtagtgatc gggacggtgg 16980 
aactcggccg ttacgcccgg cagactcacc gtcgttcccc cagcacccgc cggtttccgc 17040 
tgcgtagtaa ccacggcacc tcctcaacac aggccccatg ccgcataatc ggcctcgcac 17100 
ggactcaaaa ccgcccaaac cgaccaaccg ggcaacacga ttttatctct gaacacgaac 17160 
gtccccgcgc cgagacggct ggaatacggg ctgccgccgg acctgaacca gtcaggcgtt 17220 
gtgcttcaac aacgttgctg tgcacccacc tgctcgggtt ggccaaccgt gtggacgcaa 17280 
cgccggctcc agtgactgct ggtgatcaca ctcgttgatc atgatcgtct ggactgcggt 17340 
ttgatcaagg caatgttccc gcacctggcc ggggtgcggg agcggctggt gcgccggttc 17400 
gcggagttca ccaacgagta tccgtggcgg tggacaccgt cgcatgtgga cgaatggtcg 17460 
caatcaccga ccggagaacg gcatctcgcg ccatccacaa tccgcgccta ccaaggcagt 17520 
ctcaggctgt tcaacgaatt cctctgtgac agccggtatg gctgggcgct ggcgtgcgag 17580 
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gcgtcgttcg ggccggacga gtacccggtg 
catctccagt cctatgaggg cgatccggac 
cggttccttg actacgccac gaccaagtcg 
cattggcagc ctatcgagac gcgaccttgt 
ccggggccct acccgcgtgc gctcccgcga 
tgtgggagat gctagccgtc gcgacagatg 
agcggctcgc gtgcagtatg cggctcgcgc 
ctgcggatcc gtggaggtag ccggcgatga 
tgaacccgag ggcgtgaagc tgtgcggcga 
gttcgccggc gccggccagc cgatcggcac 
tggccgcagg ctgcaggtag tcgaagacca 
attcgagggt ggcgtgggcg gcgttcgcgg 
atactgccgg gtccgtccgt ttgaatccgg 
tttcgaaatc gaccggtacg aaggtcaacg 
ggcgttcgcg tttccatgct tgggtggcgg 
ggcgcggatt gcgataggcg aaggtgtcca 
caccatcggc gatggccgcg gccacggcgt 
acaggcggcg gggctgatcg attgccccgg 
atttggtcag ttcctctgcg gtgacgccga 
tctgtggtcg gtcagcgatc tggtgatagg 
ggctgggtcc gccgttctcc atgttcaaga 
gaactcactc acgcccgctt cacaaaggag 
ccctccttta gggcgattgc cacgagtcga 
tgggttgatg tgttgagccg ggtcatgagt 
cgccggcgcg ttcgtcgagt tgctgttgct 
tcaacggtca cggtgcgttc ggaccgcgtc 
accgggccct ccccattgat gtggccgctg 
cgccggttcc cgcagcccgc cttccccccg 
acgcgttcac ggggacgatc atcggtccac 
gcggctattc caggggcccg gccgtggttg 
tgggcgcggc atgatgagcg accgtgctgc 
cgtccgtgca ggtcacacag cagcgtctga 
tcacctcacg cagcgaatgg cgcgccagcc 
ccggtagctc ggctggaggt tgcgggatga 
gatgaccgtg cgactccgcg agcgttgcaa 
ggccgttccg cttgaccacc cgcccatcga 
tctgcgcaat gatcgcgcca caagggtcgt 
acatgtccag caatacgatg tcggcccgct 
cgacgcccgc ggcgcgggca ccgttgaccg 
accgcatgac cgtgggcagt gtcgaccgcg 
tcacctgtag gacgagccgt gcgtgcgaaa 
cgatgccgag gcttggcccc ggcgtgttct 
tgcccatctg catctcggtc tcggcgcaaa 
ggtgccattc gttctcagtg ctgaatgtcc 
tgttgtggga gtgcagcgcc tcgatgtcct 
agtggaacgt gatgcgtgcg ccgagctcgc 
cgaagggagc gatggctagt tcctgcgggg 
tcgggaagta ctgggcgcgg atttcccggg 
cttccaccga ctcgtccagc ccgcctttgg 
ggaggccgtg gccgtacagc gcgcgcacgc 
gcgcgtgctc gccggtgacg atgttgtggc 
taaggcaatc gaggccaccg aggtagttgc 
tcgccatctg cagccggaat cccctcaggt 
cccggagccc ggtttgccag gtgtgtcgat 
tcgacgaggc gtcgattcgc tcggtgtcgc 
tgacaccgtt gtcgacgagg atgtcggctc 
tcaccacggc agcgttctcg atcagtgtgc 
ggcatcggat gccgtcctca agcggtcccg 
ctgtggagag gggccgtctc gcagacgggg 
accggcggtt gtcagtcgca tcgtcgcgcg 
tgcgtccgag tggtgaagcc gagtttggtc 
cggcgggaga tcaccatgat ggcggcgatt 
gctgctacct ctcgctcgcg cggagtaagc 
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gcgaatgtgc acgagtggaa ctcactcgcc 17640 

gaccgcccgt tcacccgcga tgagctgcag 17700 

accgcgcggt tcgcgcgaaa cgcaacggcg 17760 

tcaaagtgat gtatcgctag ggcttgttcg 17820 

ttttttgagt ttgtgcagtt cagaggacga 17880 

tggtccaagt aggcgcggca ggcaaacctc 17940 

gcagtgcgcg gggtggttcg ccctcgaaac 18000 

ggtccggggc ggagtggtct tcgatgccgg 18060 

tgtcgtcggt tgtgaagtag ctgaacaagg 18120 

gtgcccgctg gtgcgcacgg tcctcgtcgg 18180 

cctcgaccgg ctgggcttga ccagcgatgt 18240 

tcagatagaa gacaacgccg agccacacga 18300 

cggattccaa tccagttgcc agcggctggg 18360 

tctcagggcg gtcgatccct gcgctagcaa 18420 

ggtggtcgac ctcgaagacg cgcaagtccg 18480 

ggcccgcgcc gaggatcacg acctgccgta 18540 

cctcggcgaa gcgggcacgg gcggcgaaga 18600 

cgccgaggtg atcggttgtg ggcgtggcca 18660 

gcagacgtac cgccaggggg tcggtgaaga 18720 

cgcgggcgta cgcggtagcg agcgctgtcc 18780 

acttactcac gaacgcggcg cgggctctat 18840 

cacgcgaaca cgatccgtaa ccacagctgg 18900 

caggcagggc tcgaaccctg ttatcggggc 18960 

tcgcggttgg tggccccggc gcggcaagat 19020 

gcggcaagaa acgacagctc ggccgaccgt 19080 

ctgccaccac acctgacggt ctttgtcgtc 19140 

atgctcgaat gcgcacgagt ggtcagtcaa 19200 

gaagacggcg cgttcccgca cggcagcccc 19260 

gaggcgaata cctgctccgt gcaggtggaa 19320 

ctcgaatcgg cgctcgcacg cggctgacgg 19380 

tgcgactggc cttgcctatg ccgataacct 19440 

cggttttccg gtcgagcccg gccagccggg 19500 

acagctggct tggttctcag cggttgaaca 19560 

atccgccgtg ctgggccatt tggtcgtaga 19620 

gcgctccgtt gacgtcgacg tgcaccagcc 19680 

ccagcaccgt gtgcacgttg cccgagttgg 19740 

ggcggttcca accggcctgg ctgatgcctg 19800 

tgcccggtgc cagggttcct gtcacgtcgt 19860 

tgagccaccg taaggcgtcg cgcgttttcc 19920 

cgtagtcttc ttgatcgtcc ctccatcggg 19980 

tcatgtcgcc gcaggtactg ctcgtgcagt 20040 

cggtggcctc acgaatgacc gggaatccca 20100 

ccgagatcat ggtgccggta ccccgcagca 20160 

cgtgcaccaa gagaaggtcg ctgccgagca 20220 

tgaacaggtt gcgcgcgatg acttggttgc 20280 

gggccagctc gaattcctgc ttgacgtccc 20340 

cgatgccgaa gcgcagcaat tgttgctcgc 20400 

ccagccgcgc ccgccgggcg aagtcgcctt 20460 

tctcggacca cgtgttcgac gtgatgggca 20520 

ccgcgtcgcg gagtccagcg accgcggcgt 20580 

agtaatcgac cagcgtggtc acccccgagt 20640 

ccgcatacat gtcctgcggg cggtacttcg 20700 

agtcgaggat gttcccgtcc gccagaatgc 20760 

gggtgtcgac catgcccggc atggcgatca 20820 

caacgttgag gttcggcccg atctcgacga 20880 

gttcgaagtc gccgagaacg tcgtccatcg 20940 

gcatggccaa gtgctcctct ccgcgcggtg 21000 

ttcggtcgtt gatcgactgt ggagaagagg 21060 

cagcttctac tggtccacgc caatccgaaa 21120 

gtcttctgtt gaatgaccca ggtcgcgatt 21180 

aggatgtgct cgacgtgggc ctcagcggtg 21240 

tctttgttgg tggtgccctg tgcgaccagc 21300 

accgacgcgt gcgtttcttc cgtcctggag 21360 
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cttgctggtt ccccggcttt gcccggattc 
agctgtgtgg cgtggtgaac gcctttctga 
actgcctcgt agtggtcgtg taggcgcgca 
acgttctttt tgatcgtttg ggacacgccg 
tctgcgctgg ctatgcaggc cagaatctca 
aagccgagcc gcaggcgaag actctccagt 
gctagtaggt ctgctattcc cttcgcccac 
taggtctcct tgacggcagg gcacccgcgg 
aggaacgcga ctgccaacgc gagctcgatc 
atcagccgat gcttggcgag agcgtcttcg 
ttgaacaggg cggtgagccc cgagatctgc 
aactccgccg ccagcgcgct cgcctgctcg 
agcgcggcat tcaggtaccc gtcgacgtag 
aatcgatggt aaccggacat atgcggctaa 
acgctaaccg gtatctcggg cgggtcgcgt 
cgtcttccgc tcgacgatct acaagtacgt 
ctctgggccc ggtccgcagc cagcgttcca 
ccgggacgct tcggccgccg tgggttcagc 
gggtccgtcc cgcccaggac caggactgga 
catttgatcg tggtttccag cagtaatgga 
ccggagagtg tcttggagca accgattccc 
gttctgttgc gacctgagcc cgccggcgcg 
agccattcct cgatgggcag cacgtgtgcg 
tccgggtcgg agacctcgac gtagcccgcg 
atcggcactg cgtaccgggc gtcgaggatc 
atcgcggcgg gcagcgggct cggcggctgg 
gcgggcagga acaccgcgtc gaacgggctg 
tggaacatcg tgtcgccgcc gtggaagacc 
agctgcgggt cgcccagccc gtcgacggcg 
tcgcgggtgg accaggcatc cacgacctcg 
tcggccggca gtgtcgtcac gttgtccacg 
accggtgccc ccggtgtcag cgcggccgcg 
aggtgggtga ccagtgcggc ggtgaccgtc 
ttccacgccg tgaacagcgg tgagaggtcc 
cccgcctcga tttccagtcc ggcccagccc 
ggtccgtggg tgaagcggaa tttagcgtac 
gtacgctatc ctcgtcacat gtcaccgcga 
agccggatcc tcgaccgtgc cgccgagatc 
atcggccggc tcgccgagga actggagatg 
accaaggaga cactgcagat ctccacgctg 
gtggtcgagc ccgcactggc cgagccgccg 
aacgccgtgg gctacctaga agcgccactg 
ctcaccgagt acgacggccg ccccggccgg 
cgctggcggg gacagctgcg ggcggacctg 
gccgggttcg acatcgacca ggcgctgttc 
gcggccatgc agctccagca cgaccgggcg 
cgggccttgg accagtcctg acgcccgcgc 
accagaccgt ctgaccgcgt tcctgctggt 
tacgccctag catggtcacg agacattccc 
aaagctactc gccggttgct ttgcaccttg 
cccgtgggtt cagtgtgagt gaaggttcct 
ggtctgaaga cttcgtgaag aaggtatgaa 
atgggaagcc aaggcttggg agcgtggaca 
ccaccatgat cttcaacctg gtagccccgg 
tgcggtacga cagccgcaat ccgtacgaga 
gtcaggtgga ctgggtgatc gcccgcgacc 
gcgaaggcga tgtgcggatc ggccctcgac 
tgagctcgcc gtcggggcag gcctccttcg 
tgaacgacac ctacgacgtg gtcgaacctg 
aggtgctgag ccagctgctc tcgcacaacc 
acgccaaagc gctggctgcg ggacctggcg 
gctccagctg gaccacgtcg gtgccgtgcg 
ttctccaggc gcgccatcgg cgcaggaagc 
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cgtatccatg cgatggcctc gtcgaagtcg 21420 
aacgtctcgt cgccgagtga gctacgtgct 21480 
acatgtttgt ggccggcgag cgacgcatcg 21540 
aacaaggtcg cagcccgtgc cgggtcaccg 21600 
acgcacaacg cgatctgaaa ctgctcatgg 21660 
tccgcagcat tggcttgccg gtaattgccc 21720 
agggcaagcg atctcatcca ctgttcgctg 21780 
aacagggcag cggcgcgctc atgatcacca 21840 
tgatcgtagg cggcggcagc gtgatcgcca 21900 
aagagcgcgc tcgcccgcgc cggatcttcg 21960 
gcgacgtagg ccgcccctga ttggtcgccc 22020 
agaagggctg acccggatga cgcgtcgttg 22080 
agagctttga cacgtaccgg atccgcatcg 22140 
tggtgcacaa cgggcagtgt tgatctgacc 22200 
gctgcgccgt tcaagctcaa gcgtgcggtg 22260 
ccccgaactc agcacccacc gagcgctcgc 22320 
gagctaagcg ctggtttccg gcgtctgcta 22380 
gcaagtgagg gttcctcggc gggaacccaa 22440 
cccgtgggtt cggctcaacc cggagtgtcg 22500 
tgctgaatct tgtgattgac gccgccgcga 22560 
gcgcgattcc ggaccatcca tcggacctag 22620 
gcggagtttc cgcctcaagt ggccaggtcc 22680 
cggtgtccgg cgtgcgtgcg gaactcattc 22740 
atcttgtccg gctgttccgc ctcgtagtgc 22800 
tctgcggctg cggcggcctg cctcgggtcc 22860 
aggtgtggcg cgtcgaccac cgcgccgttg 22920 
aaccggcgcg cgatgagcca ccagtagccg 22980 
cgctgcccgt cggcctgcac cacccagttc 23040 
gggaccgcgg tgacgcggaa cggcccgagg 23100 
gcagccagtc ggtgcagtgt cagctcgcgc 23160 
tcgtcgccgt ggccgggcgc gggtcggagc 23220 
agcgcggccg cgtcggtgtg gtcccggtgc 23280 
ccgctcggca ccgccagcct ctcgccgggt 23340 
cgtacgtagt cgatcaccag tcgcttgccg 23400 
agtcgtcgca cccgcatctc gcactccttc 23460 
gcacgttcac taaatcaata gcgaacgcgc 23520 
cgctcagcgg ccgaagcaca ggccaccagg 23580 
gcatccgagg aagggctgga cggcatcacc 23640 
agcaagtccg gggtgcacaa gcacttcggc 23700 
gacaaggcat tcgtggactt ctggcaccgg 23760 
ggtctgcggc ggctgcgcgc ggtgtgcgcc 23820 
ctgcccggcg gctgcctgat gaccgcggcg 23880 
gtccgcgacg cggtggccga ggtgtggtcg 23940 
accgcggcgg tggagaacgg cgagctgccc 24000 
gagatcgtcg ccaccgggct ggcactgaac 24060 
gccgccggcc gggcccgccg cgcgatcgaa 24120 
cagacaaacc cgcccgattc tttcccgagg 24180 
cagcccgtat ccatcccatt tttcggcgag 24240 
ggagccgtca ttgattgact ggcgcagcga 24300 
ggtcagcttc gggagttccg ggcccagagt 24360 
cgctgggagc ccgaggggga ctgccaccag 24420 
cccgggttgg gcatgcttcg ttatgagggt 24480 
tcatggcaaa cgatcacggc acggtgctcg 24540 
cgggtgtgat cgcgccagtc ggcgtggagc 24600 
tctccatgaa gctcaacgta ggcacggacg 24660 
tgctggccga cgggctgatc gccgaggcag 24720 
ggggttttcc ggggttggtc gtgatcgaga 24780 
aggtgaatgc tgaccagctt gcggacttct 24840 
gtgatgaaca ccggtggatg aacgtcgacg 24900 
tgtaatggcc cagctctccc gaagcgccgc 24960 
gcgctgaaca ccgccacgct gtgtctccga 25020 
cccggctcgg tcaggccgaa ggtgctgatc 25080 
gctgcttctg ctcccgccgc agtaccgtcg 25140 
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tgtcatggcc acggacagct tcgattcctc 
cgcgtcgtgc gtctcggaag tcagccgaga 
gcagttgcga gccgatgaga tcgtcctcag 
cgccacattg ccgcacgttc tcgaattcga 
tgcggtgttg gagtatccgc gacgactggc 
ccaaccgcat cgggcacttc cgcttcctgt 
gcgccgacga ggccgtatcc gcagcgacca 
acatcgaccg cgtcacggag cagatcgccg 
tcctggcgac gggacggccg tgaggtctct 
aggcacatga ggtgttgcgc gcggtcgttt 
cgggcaattg agggagcgaa gatttagagg 
gctcgggagc ggctgtgggg cggccaggcg 
accggtacgg cggcgaggcc ttcgagcagg 
gtagtgcggc cgagtgcgat catttggaac 
gggccatctg gatagacgcc gtcggggccg 
gagctgacat ccgccatccg tacatggggc 
atggcgactt gttgctcggt gttcagaatt 
ggttgatgca acagcgccac gtcggcccgg 
tcgcagagca gcaggtcgac ggccacggct 
ttcgccagca gctcaccgga ggcgccggcc 
gtcgcggcac gctgggtgcg tcgctcggct 
tcggtcagca gcattgcccc ggcttcggtg 
aacacgactc cgagtcgttg ctcgagctgg 
atgcccaggc gctgggcggc ccggccgaag 
cgcaactccc gcgtctccat ccgtcgagcc 
tgtgagacct agatggtgtt ggttccccgc 
gaacagacga ttgcactggt caccggcgca 
gggctcggcg cgctggggtg gagcgtcgga 
gatgccgtgg cgaaattgcg tgcggacggc 
acagacgacg cgagcgtcgc ggctgctgcg 
gatgtgctgg ttaataacgc cggcatcgcc 
acaccggcga gcctccgggc ggtggtggag 
aacgctatgc tgccgttgct acgccgctcc 
cacgtcgctt ccctgacctt gcaaaccacg 
gcctactcac cgtcgaagac gttcctcaac 
agcgatacca acatcaaaat caacaacgcc 
ggcttccacg gaaccagcac gccggcagac 
ctgccagacg acggcccgac cggaggcatg 
ggcgctcagt cggcgatggt gcaatcgaag 
gaacaacacc tgttcctgtg ggtacggatg 
ctgtacttcg ggcgccgtta ccgccggtgc 
cgtggctcac cggcggctta ggtcaggcgt 
tgcgtaggtc gggtaggcgc atccggcgcg 
gtgctttggg gctgctcagg agccgaacac 
gtggttctcg tcggctcgga tcacaacgtc 
gattcgaggc gggatcgtcc tcggcgacct 
tcgtcgacct ggttggcacc tagatcacga 
cgaagtgatc ccgggtcacg tcggcttatc 
aggtcttgcg tcgttgttcc gggctcagtt 
cggtgctcca ggaaggtatt ccgcgcggat 
gtcctcagtg aacagcgaga ggatttcgaa 
ggcgactgtg tagcggacgg tcatccggct 
ccggctgatc acccgctcgc cgtggtgcgc 
cttgtcggga aaccaccagg gttcgatgac 
catgttccgc gtcacgcgtc gcaggtcgtc 
caggcaggtg atgacgtcga atggctcgcc 
aatcggtacc tcagggactc gtctgatcgc 
ccccgcgacc ttcgcgtatt cggcacggaa 
gacgtcgagt agggactgtg cttcgggcag 
ggcctcggct gcccagtccc ggccacgcgc 
atctgggctg tataccgagg aggtttctgc 
tttcggagtg gtgatctttg gctgatgtgg 
accgtgcgca cctcggcgtc gtcaaggctt 



LeA33955-US 

gaagctacag gcggccgtgg catcgagcgt 25200 

cgtctacacg cacctgatta ccgaggctcc 25260 

cattctacgg acgagtgttg aggaaaatat 25320 

gattccgttg ggatattcgc cgggtcctgc 25380 

gaaacatttc catcaacgcg ctgatcaggg 25440 

agtgatgcct cgacgagatc cgccgccaat 25500 

cgcaacgaat gctcgcaacc agcttcggct 25560 

aaacctacca gctcgaacgg gaccgctggc 25620 

gcggcatccg catagcgtct tctcccgctg 25680 

ccggcagtcg cacggcattc gtcctagctg 25740 

agtgtggcca cgcggaccaa gccggcgagt 25800 

atgactgtcg tcacgtccgg cgcgtctaga 25860 

ttgacgcgac tggattcggg catgaccacg 25920 

agttgcgtct ggttgcgtac ttccacgccg 25980 

ggccagcgcg caagcgggag atccggcagt 26040 

tcgctggcaa gcggatgcga ggtcggaaga 26100 

tcgatgtcga gttcggccgt cgggtcgaag 26160 

ccgtcatgca gcgttttctg gggctgggat 26220 

cccggctcgg cggcgtacgc gtcgagcaac 26280 

ttggcagcca ggactagcga gggctggctc 26340 

gctgccagcg cgccgaggat cgcccggcct 26400 

agcgagactt tgcggctggt gcgttgcagc 26460 

gcgatcgtcc gcgacagcgg cggctgggcg 26520 

tgcaactcct cggcgactgc aacgaagtac 26580 

taccgctgat tcatatcagc tgggtatcgg 26640 

cggtttcggg ccacgctaga aagcatgagc 26700 

aacaagggaa tcggatacga gatcgcggcc 26760 

atcggggcac gggaccacca gcgcggggag 26820 

gtcgatgcgt tcgcggtatc cctggacgtg 26880 

gctctgctcg aggagcgcgc cggccggctc 26940 

ggggcatggc cggaggagcc ctcgaccgtc 27000 

accaacgtga tcggcgtcgt tcgggttacc 27060 

gagcgcccgc ggatcgtcaa ccagtccagc 27120 

ccgggcgtcg acctcggcgg gatcagcgga 27180 

gcgatcacca tccagtacgc caaggaactc 27240 

tgccccggct acgtcgcgac cgaccttaac 27300 

ggtgccagga tcgccattcg gctcgccacg 27360 

ttcgacgacg ccgggaatgt gccctggtga 27420 

tcggagaggc tcgctgcgac cgggtacgcc 27480 

tcggccttcg ccgtctcggt cattgacaac 27540 

gccgcggttg cctggcgaca ctgggccagc 27600 

gggcggttgc cagcatggcg ggtgcggctt 27660 

ggagccggtc gagttcttcg ccgatggccg 27720 

ctcccagccg caggtgccgg gctgaaccga 27780 

tgccggaaca gctgcggcga ggtggtcgca 27840 

tgccgacgat cgcggctagg gcccagggct 27900 

cggtcaaaac ttgccggcat cagagacgat 27960 

ggtcgagtga gtcccggggc ctgcccagcc 28020 

gcggattccg acgaacaggc ctcggccgtt 28080 

ccctgcgtct tcgagcgcgg cggtgtactc 28140 

ctctgtgaag tcccggatcc cggtgggttc 28200 

cgtacggccc tccaggaccg agtgcgatag 28260 

gacggctccg gtgacgaacc cgtcgatgaa 28320 

cgcgactcca ccaggggcca ggtgccgggc 28380 

aacggtccgc atgtaagccg cggtaaagca 28440 

gaggtcgaaa tcgcggatgt caccgatgtg 28500 

gatctcccgc atcgcatcgg acagttcaag 28560 

tcgctctagg tgcgccccgg tcccacaggc 28620 

cctggtgcgt acgagctgga ctacttcccc 28680 

ggagtggatc gcgtcgtaga tgtcggcatg 28740 

gaatgtgtcg ctcacgcgcg acatcctcac 28800 

tgttcgacgg ccttctggaa ctcgtcagcc 28860 

gggtgcagtg gtagcaggag tgttctgcgg 28920 
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caggcgtgct ccgcagaagg cagcttgcag tccgcgcggt agatggggac cttgtgcagg 28980 

ggcgggtagc ggtagctcgt gtagatgccg cgttccagca tttgctgcgc cacctggtcg 29040 

cggatctccg gagccagctg gacccagtag aagtagtgtg acgagacgtg cccatccggt 29100 

agcgtcggcg gtaggaggac acccggcaca tcggaaagca accggtcgta ctgcgtagcg 29160 

atttctctac gcctgttgat gaattctggc agtttgcgca gctgcacgct gccaagcgct 29220 

gccgtcatgt cgttcccgat cagccgctgg ccgatgtctt cgacgcgaat atcccaccag 29280 

cggttggaag acttggccga atcgaatccg ctcatctgct caagaccgtg gtaggcgagt 29340 

cgtcttgcgc ggtgcgccag ctccggatcc gccgcgtaga acatgccccc atccccggtg 29400 

accaggatct tcatcgcatc gaaactccac gtggccaggt caccaaaggt tccgcaagcg 29460 

gtgccgtgca cggacgatgc caccgcgcag gcggagtcct cgatgagcat gaggcccttt 29520 

tcacggcaga aatcggcgat cgcggtgact tctcccggcg atcctccata gtggagcagc 29580 

aatacggcct tggtcgccgg cgtgatggcc ctcgccacat catccagcgt ggggttcaac 29640 

gtccgggggt cgacgtcgca gaacaccggg cgggcaccgg aggatgcgat ggcgttggcc 29700 

gccgccacga agcttatcga aggaagtacc acgtcgtcgc ctgggccgag gtcgagcacc 29760 

tgcacggtaa ggaacagcgc ggcagtcccc gagttgagga acacgacctg ttcgggatcc 29820 

actcccaggt ggtgggcgaa ttcggcctcg aacgtccggg tgcgcggccc gagcccgatc 29880 

cagttggagg cgaacacctc cgcgatcgcg tcgagttctt cggtgccgag gatcggctgg 29940 

tgcaggttga tcacgttgct gaaatcctcc gagatgccgc catgctggat gctaggaact 30000 

cttggccacg aattcagcga ttgattcgac gacgtagtcg atcatttggt ccgttatgcc 30060 

tgggtagacg ccgacccaga aggttcggtc ggtgacgatg tcgctgttgg tgagcgcgtc 30120 

ggcgatccgg taccgcacct gctcgaaggc cgggtgccgg gtgatgttac cgccgaacag 30180 

cagtcgggtg ccgatgttgc gggattccag gaagttcacc agggcggcac gggtgaaccc 30240 

ggcgtccgca ctgatggtga tcgcaaaccc gaaccagctc gggtcgctgt gcggtgtggc 30300 

taccggcagc agcaggcccg gcaacccgga cagcccttcg cgcaaccgtc gccagttacg 30360 

gcggcgtgcc gacccgaatg cggaaatctt gctcaactgg ctcagcgcaa gtgcggcctg 30420 

caggtcggtg gtcttgaggt tgtaaccgac gtgggagaac gtgtacttgt ggtcgtagcc 30480 

cggtggaagg gtaccgaggt ggtagtcgaa cctcttgcgg caggtgttgt ccacgccggg 30540 

ctcgcaccag caatcccgtc cccagtcacg cagcgactcg atgatgcgag ccaattccag 30600 

gctgccggtc aacacgcagc caccctcgcc gctggtgatg tgatgggcag gatagaagct 30660 

gaccgttgtc aggtcgccga aggttccggt cagccgtccc cggtaggtgg atcccaccgc 30720 

atcacagttg tcttcgacga ggaacagctc gtgttctttt gcgatctccg cgatttcgtc 30780 

agcggcgaag gggttgccca gggtgtgcgc cagcatgatg gctcgcgtcc gttccgtgac 30840 

ggcggccttg atgcggtctg gcgttgcgtt gtaggtgccc agttccacgt cgacgaatac 30900 

cgggacgagt ccgttttgga ccgccggatt gatcgtcgtg gggaagccga ccgccgcagt 30960 

gatcacttcg tcgccgggcc gcagtcgtgc ctcgccgagt ttgggggagg taagcgaact 31020 

cagtgccagg agattggccg acgaaccgga gttgacgaga tgagccttgc ggaggccgaa 31080 

gaagcgggcg aactcgctct cgaatcgccg tgcattcccg cccgcggcga tccggagctc 31140 

cagcgcggct tccaccagtg ccacccggtc gtcctcgtcg agcacggcgc ccgatggccg 31200 

gatcggcgtc gatccagcca cgaaggtcgg ggattcctgt tcgcggtggt aatcgcgtac 31260 

ggatgccaat atccggtcct tggcatccgg caccatctca gtagcggtag cgcaagtgtc 31320 

gtcacacgaa gtcactctgg cgcgcccttt ccccagcgct ctggttttcc ggctctgcat 31380 

gcaggcgacg atcagtcttc gcgccttgcc ttcaggagat gagcgatgcc cgtggcgaat 31440 

cgcgttatga cgtcccagcg ggacagtgtg ctgtctcggc gccttacacc ttcctgccct 31500 

ggttcgatgc ggtgcgggac atcaggacag cggagcaagg agaagcgctc attgactcag 31560 

aaatcctcga tctacccggc acacccgact cggtagagcc caggctagcg ggaacgacct 31620 

gctcgcgctt gtcaagatcg ctaccatcac ctggaaggcc taagatttgg cttgcgaaag 31680 

cggcgtttcc cgggggatat cagagatttc tgtgattctt ggcatgcttc ccgggtgttc 31740 

aattgcgatc ggagagttca tgcgtgtcct gttcaccccg ctgccggcga gttcgcactt 31800 

cttcaacctg gtgccgttgg cgtgggcgtt gcgtgccgcg gggcacgagg tccgtgtcgc 31860 

catctgcccg aatatggtgt cgatggtcac cggagcagga ctcaccgcgg ttcccgtcgg 31920 

cgacgagctc gacctcatct ccttggcggc caagaacgaa ctcgttctcg gcagcggggt 31980 

ctcgttcgac gagaaggggc ggcatccgga actcttcgac gagctgctgt caatcaactc 32040 

cggcagagac acggacgccg tggagcaact ccaccttgtg gatgaccgat cgctggacga 32100 

tctcatgggg ttcgccgaga aatggcagcc tgatctcgtt gtgtgggacg ctatggtgtg 32160 

ttcggggcca gttgtggcgc gagcgctcgg cgcacgacac gtgcggatgc tcgtcgccct 32220 

cgatgtgtcg gggtggctgc ggtccggttt cctcgaatac caggaatcga agccgcctga 32280 

gcagcgcgtc gacccgctcg ggacgtggct gggagcgaag ctcgccaagt tcggagccac 32340 

gttcgatgaa gagatcgtga cgggccaagc gaccatagat ccgattccat cctggatgcg 32400 

cctgcctgtg gacttggact acatctcgat gcgtttcgtg ccgtacaacg gtccggcggt 32460 

gttgccggag tggttgcgcg aacgaccgac gaagccgcgc gtctgcatca cgcgcgggct 32520 

gaccaagcgg cggctgagca gggtgaccga acagtacggg gagcaaagtg accaggaaca 32580 

agcaatggtg gaaaggttgt tgcgcggcgc ggccaggctc gacgtcgagg tgatcgccac 32640 

cttgtctgac gacgaagtac gggagatggg ggagttgccc tcgaacgtcc gggtccacga 32700 

Page 9 



atacgtaccg ctcaacgaac tgctggagtc 
gacgacgcag gaaaccgcca cggtcaacgg 
ctgggacgaa tctcgtaggg cggagctcct 
ccccgcgacg tttaccgaag acgacgtgcg 
gtcgttcgct gccaacgcgg cgctgatccg 
cgacatcgtt ccacgtctgg aaaagctagt 
gtctgatggc catccgtgag caacgtgtgg 
tgttcatcgc tgttgcgtcg actcggattc 
ccgaagtcag attcttgtcc agaatcgtcc 
gctccaaccg atttccgacg aggatcgtgg 
gagacatacc gctgtgcgcc aggagcgaag 
gatgccgagc cgaaggtcac cttggatgag 
gtggccgacg tccgtgtctt ctccggcggg 
gtgctcgcgg tccttacggg cggatcgctt 
ccgcgacgcc cctgggagcc ggcgtgcaca 
cagttctgtc gaagctgagg caagtgctgc 
gttcgtcgac tctgcgctga gcgcttgcaa 
ctggctcgcc gatcggctgg gcgagaacag 
cctgtcgaaa tggaaattcg atgccagcac 
cttcacggta gaaggcctgc aggtcgagac 
gccgatcatc aaccaggctg aagtaggtat 
cgtgctgcac tgcctcatgt cagcaaagat 
ctcgccgacg gttcaggcaa ctcggagcaa 
gccctatgtg gactacttcc tcgggcgggg 
gtctgaacag gggtcctggt tctaccggaa 
ggaggaagtg ccagtcctgc cagacttctg 
ccttcgtcag gacaacatcg tcaacatgga 
ccacgattcc gccaccggac ccgaactagc 
ggccaggtcg ctctcgcacg gcatcgattc 
cgaggaagcc aaggcccgct accgcttgcg 
caagtggtat cgcaccgata ccgagatcgc 
cgcggtgtcg gtgtccgcga ccaatcgtga 
accgcgagaa caaggtgaga tcgcactgtt 
tttggtccac gctcgggtgg aggctgggta 
ccagtgcagt gtggccaact accaaagcac 
cgacgtgctc accgccgatc ccgaaaccgt 
cggtcggttc taccaggcgc agaacaggta 
ggcacgacct cccagcgact tccggtggat 
gagcacccac ttcttgaaca tccaggcgcg 
ggcgttgggg cgatgaccag ctcgatgcga 
gcttccttcg cgtggcgacg gatgctgccc 
gtggcggtgg cgagccgtga tccggcgaaa 
gaggcggtgc tgggttacca gcggctcctg 
ccgttgccgc ctggcatgca tgcagagtgg 
gtgcttgcgg agaaaccgct gacgacgacg 
gccaggagga agaacctgct gctgcgggag 
gacgtggtcc gcgacctgct gcaatccggg 
gtgttcggaa ttccgccgct tcccgacacg 
ggagcgttgc tggacatcgg tgtctatccc 
ccgctcacgg ttctcggcgc aagctcgcac 
ggcagcgtgc tgctccaatc ggaaggtggc 
caccactacc gcagcgcgta cgagctgtgg 
gcgttcacgc cgcccgccga gtggcaggcc 
gacgagttgt ccttgccagc ggaagatcag 
gacatcagag cagggacagg cgtggacgac 
atgatccagc aggccgcgct ggtggaggcg 
tagccgcccg gcatccgcgg gtagtagttc 
cagcggggaa gccgctggag aggctcaccg 
ctgatcgcgg acggctcgga gaagtgctcg 
gccagggcgg cgggaaagtg agccaatccg 
gccgcgcggt agctgtcccg gagtcgctcg 
gtgctgtggt agacgagcgt ggggaccacc 
gccagtcgtg cgcgccatcg aggttgctcg 
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gtgttcagtg atcatccatc atggctcgac 32760 
cgtaccgcag ttgattctcc ctgggacctt 32820 
agccgatcgg ggagccggtc tggtcctcga 32880 
aggtcagctg gcccgcctgc tcgacgagcc 32940 
ccgtgaaatc gaggaaagtc ccagcccgca 33000 
tgccgaacgt gagaaccgcc gcactgggca 33060 
ccggaaacat ggacgccggg gtttggcagg 33120 
cgccgtgacc gggacgatgc caggcgagtc 33180 
aatggggtgt tgatctcccc agaggtttgc 33240 
cgcccgctga gcaacgacta ccgtgcggtc 33300 
gtgggttgcc cgatcaccgt gctggtggta 33360 
gcggaagcct ggcgagagca caccgaggcc 33420 
catttcttca tgaccgaacg ccaggacgag 33480 
cgatgatcct cgccaggccg ctggaccaga 33540 
tcgtcacggc agtgagggat tgggcatgag 33600 
tgcgccgctc ggcagcaaca acacgcggcg 33660 
tggcatgatt ccgaccacgg agttccactg 33720 
cttcgagacc aatcgcatcc cgttcgaccg 33780 
ggagaacctg gttcatgccg acggtaggtt 33840 
caactatggc gcggcaccca gctggcacca 33900 
cctcggcatt ctcgtcaagg agatcgacgg 33960 
ggaaccgggc aacgtcaacg tcctgcagct 34020 
ctacacgcag gcacaccgtg gcagcgttcc 34080 
ccgcggccgc gtgctggtag acgtgctcca 34140 
gcgcaaccgg aacatggtgg tggaagtcca 34200 
ctggttgacg ctcggccagg tgctggctct 34260 
cacccggacg gtgctgtctt gcatcccgtt 34320 
cgcctcggag gagcccttcc gacaggcggt 34380 
gtcgagtatc tccgaggcgg tcggttggtt 34440 
ggcaacgcgc gttccgctga gcagggtcga 34500 
ccaccaggac ggcaagtact tcgcggtgat 34560 
ggtcgccagc tggacgcagc cgatgatcga 34620 
ggtcaagcgg atcggcggag tgctgcacgg 34680 
taagtggact gcggaaatcg ctcccacggt 34740 
cccgtcgaac gactggccgc cgttcttgga 34800 
gcggtacgaa tcgatcctgt ccgaagaagg 34860 
ccggatcatc gaggtgcatg aggacttcgc 34920 
gactttggga cagttgggcg agctgctccg 34980 
cagcttggtc gcctccctgc atagcttgtg 35040 
aagccggtgc gcatcggtgt gctcgggtgc 35100 
gcgatgtgcg acgtggccga aacagaggtg 35160 
gccgaacggt tcgcagcgcg attcgaatgc 35220 
gagcggccgg acatcgatgc cgtctacgtg 35280 
atcggcaagg cgcttgaggc agacaaacac 35340 
gcgtccgaca ccgctcgcct ggtcgggctg 35400 
aattacctgt tcctccacca cggccggcac 35460 
gagatcggtg agctccggga gttcaccgcc 35520 
gacatccgct atcgcaccga actcggtggc 35580 
gcccgtgccg ctcggcactt tctcctcggt 35640 
gaggcccagg agtcgggcgt cgacttgtcg 35700 
accgttgccc acctcggata cggtttcgtg 35760 
gggagtcgtg ggcgaatcgt cgtcgaccgg 35820 
gtgatccgaa tcgagcggaa gggcgttgtc 35880 
gttcgcaagg cggtcaccgc cttcgcacgc 35940 
cctgcggtgg ccggagattc gggcgaatcg 36000 
atcggtcagg cccgtcggtg cgggtccaca 36060 
gcctcgaagc ctgaccgggc atccggaagc 36120 
ccatccgctc acctggcatc tcgcggaccg 36180 
tcgaaccacg agacgaccac tcgcgagctg 36240 
gagagcggat gccaccgcac tggcgtaccc 36300 
ccgaatgcga acggaacgat ctcgtcgtcc 36360 
gggccaccgt tcctacctgc gacgctttcg 36420 
aaaaggccgg aagtgtcgag gaagtcgctc 36480 
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agctcgcggc cgaggaagcg ggtgacgagc 
acggcggtac gacccgcttc ggtgagaagc 
gcatgcccga ccaggccggc cagcaccggc 
atgtccagca gatcgatcgg caccgcacct 
gcgtaggtgg ggtgcagttc gccggcgaag 
cagatgccga ccgggcagtc ggtcgtcagg 
gcggcatcga gcatggcgtg tccctgcgcc 
gtaccgaggc cttcgtagtc ggtgatgacc 
atcagctcgg tctccggctc ggttccggtt 
ccgaggccgt gggtgcccac tgcgaaagtg 
atgttcggca ccagaacggt gccggagacg 
cggtagagga tttgccaggc cttggctgcg 
acgggccggg ccctgaggag cgtgcccggg 
tagaagggat catccgcggg tgcccgcaga 
gccatcagga ctgcttcttt cgagcctgca 
tggcgtaatc cggatgtttc cggtattccg 
gtccgtgctt cgccgcgatg tctcccaagt 
gctcgacgcc ggcgagctca tcgagtttcc 
tgcacctggt ctgtgccgtg aggtcggtga 
ccagcgccgt gaagctgttc aaggtatggg 
cttccgggag tcgaacccga gtgatgatgt 
ggattcggtg gacctcgcgg atcgcggcgg 
tcgcccatgc ggcctgaaag aaaccgtccg 
cgaactcaag actgccggcc agtccggtct 
gctgggcgtt cacggtgatt ccggtgactc 
cgggctgccc attgccgcag cccaggtcga 
tgtcgatgaa caggtcggtc agttggtcgg 
cctcccgata cccgcccgcc cagtaaccgt 
cgaagatgga ctccacctga tccgcggtcg 
tcactcgtcc tccgcgctgt tcacgtcggc 
cacccaagca ggaacgccgc cttcggcgtt 
gtaggtgtgg aggccgacca aattgcgttc 
cggctgctcc tcgtgctgaa cgccttccaa 
gggtgcggcc gggcactgcg cctgcccgcc 
ctcgatcacg tatacgccac cgctgcgcaa 
cacgtgtccg ttgatgtggc tgccatcgtc 
gtcaacggcg gccagctcct cgggcttgct 
ctggtcgagg aaggacttgt cgaaaacgtc 
gcgcttccac atcttcaggg attcgccgcc 
gatctccagg atgcgcaccg ggcgatcacg 
ggtgaaccag tgcaggccgc cccacttgtc 
gtcgggacgt cggtgcccgc agccggcgac 
cagttccgac ggaccgggta tcgaaccgga 
ggcgccgacg gccctgggag ctcctgggcc 
gtaaccgatc cgcagcggga cgtctccgac 
ggactgtagc gtcaggacga agtcaacggt 
ctcggtccgt tcgatgtcgg gggccggttc 
gggtgcgatc ctggcgagtc cgagttcggt 
tccggcggcg aggatgatgc gttccacggt 
gagctcctca tggctgaccg ggtgaaagcc 
ggaagatgtt ctgcggatcc caccgcgctt 
tgtagtacag gtcgtgccag gcaacaccgg 
tgtccgggta gttgatgtag gagccgtcga 
tttcggcgta catctcgcgg tagaaaccgc 
cgggctccga ccagttcgtg acgaacagcg 
cggtggccga cggagccacg gcattcgcca 
ccgcagggtt gtcgtatccg tagacggtca 
cggacagccc ggtgcgcaag taggcggctt 
cgccttcggc gatcgccccg gccacctggg 
gcccttcggc tggagtcacg ccgacctggg 
tgcgttccgc gttcggatcc gtgccgtcca 
gccggtgtgc gcagacgagc gtgctgaaca 
gctgctcgta ccaattgccg aagttctgta 
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tccggtgcac cgagctcgcg cacttgatca 36540 
tcgtcgaatg gcagatcggg gtaggcagcg 36600 
ccggtgaaca ccccgtcatt tcggtggatg 36660 
gcggccgcag cgcggattcg cagttcaggt 36720 
gccgacgctt gcccaccctg cgcatagccc 36780 
ccggagcccg gtagccgttg cgcagcgcgg 36840 
ctgccgacgg tgtaggtgtg ggttccagga 36900 
acggcccacc cgcggtcgag ggccacggcg 36960 
cgaagcaggt acgacggggc aacttggcta 37020 
atgatggggc gatcttcgcg cggccacggg 37080 
gcgttcggca tgccaagggc ggagttggac 37140 
acgggttcgc ccgtgccgcg cagtgccgag 37200 
acacccggcg gtagcggcgt cggcggtcgg 37260 
tcgtcgccga ccaggctggc gtgctcggag 37320 
ggagcatgaa acccatgctt tcctcgtttc 37380 
caaccgcggc gatcagctgt gctggtcccg 37440 
agcgttgctg gtaggtgccg acagccgcag 37500 
ggagcaactc gtcgacgtac caggagacca 37560 
cttcgagaat ctcgaacccg gcttcgctga 37620 
cggtcgtgcc cgtccaaacc gccgcgtact 37680 
ctccgaggac gaaccggccg ccgggttcca 37740 
cctggtccac gatctgcacg acggactgca 37800 
ggtagggcag ctgggcgccg tcgactagat 37860 
cgttggcgag cctggtggcg gcggcgagat 37920 
gaacgccgct ggcgcatgcc gcacggacta 37980 
acaggtgcgc tccgggacgg agcgcggcct 38040 
cagcatccga ccacggtgtg gcaccggcat 38100 
ggtgcagggg acgcccgtgc gccaacgcat 38160 
gaaatgcctg tgtgttcgcc cctctgctgt 38220 
caggtgcaat atgtcgtcca gactccttgg 38280 
gacgcctttc tccaggaacg cgatgttgtg 38340 
caggtagctc ggctcgtacg agcccgcatg 38400 
caggttcttg agcaggctga ccgtggtgcc 38460 
gaatccggga gcataggtcg tccacagatc 38520 
ccgggggaac agcgtttcca gggatgtgcg 38580 
gatgatgatg tcgaacggtc cgtacttgtc 38640 
ctggtcggcg cggacggtgc agagcctctg 38700 
catcccgaac acgaggccgc ggtggaagta 38760 
gccaccgtcg aagttgtagc caccgacacc 38820 
gaactcgccg aggtgtcgct cgtatagcgg 38880 
cgtgcggtag tgggaggcga gcaagttgag 38940 
cactgcggag atggcctgga agccatcgga 39000 
tgtggtggtt cggaggaagt tggtgctccg 39060 
gaacaactcg gcgatgagat cggtgagctc 39120 
cggtcgttgc tcggccttga tcagctcacc 39180 
ctcgcctcgg tgggtgatct ggaccgcgac 39240 
cgcgcggaag aggatctcgt cgatcagcac 39300 
ggtcaggtcg gccaggctcg ccgcactgga 39360 
ttcgatctcg tgcgttgtgg acatcgtgat 39420 
gtgccggcgg tttgatcgac aggccgtgct 39480 
tggcccgctg cagccgcggg tagttgtctt 39540 
aggtgttcca caatggatcg gccaagtcgg 39600 
cacgggtacc tggcaccgga actccgccgg 39660 
gaatccaggt cagatgccgc tcgtcctcgg 39720 
ctttgagaac cgagtcgcgc tgagcgagtg 39780 
taccgccgta accgagcagc aacagcgccg 39840 
gccgccggta aaccgtggct agttgagctt 39900 
tgaccttggt ccgttgcatg cccggttcgc 39960 
tcgatcgcaa ccacggcagg gtttcccgca 40020 
cgttgatcgc cgacaggtgt tcggccaggg 40080 
ggtgaacgtt cagcgtgacg tagccagctt 40140 
acccgagttg cgtggattca ggcgcgctgt 40200 
ggagcacggc gaatgactgc tctgtcagtt 40260 
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cgtgccacgg ccagtggaac gatcggagca 
ctgcggcgtc ggtgctgacc acgtccggcg 
agttgccgcc accgccaccg gtgtgcgccc 
tccggtcggc ctcgacgatg tgcacttcac 
cttgaaggta gtcgacgacc gaaccgaatc 
ggatgtgccc gcctgcgccc accccgggac 
tcttgaacag ggttcggtac acctgcccga 
cgcgcgtgct gtcgtagtac acgcggttga 
ccgggtccgc aacgagattc tcgaagcagt 
cggtgcgcac ggcgtcggcg acggcgtgca 
tgcgttcggg ttttccggtg aaacgggggt 
gatcgtcggg ctcgacggtg atctctgttc 
catggaattc gggcaccgcc cctccttttc 
agatcatcta cgcgtccggg tgattatctg 
gcctccgccg gctctacttt gtggatcgat 
gaaccgaaaa ggtgatgact ttccatgagt 
gagcgtttgc tgctcgcggc gggtgcgggc 
gccggactgg acgcggtggc cgacgccatc 
ctgtcgttgg acgagtcggt gcgaatcggc 
cggagaaccg ttgagctcga tcacgcaggc 
gctgttctcc ggttcgacgc ggtggatctg 
aggcggcaca acagccgtga agtccgctgg 
cagggcctcg ccggcgcgca gcgcctgctg 
cacgccgtgc tggccgcagc cgccaccagg 
tacggatccg acaaatgggc ggacctgcac 
tcccgattcc aggatgcccc ggtgcgagtg 
cccgaactcg gtggtgcttc gctgcgcatg 
tacgggctgg acattttcga gaaagccggg 
ggtgaccaga gcgatgcgga attcctggaa 
attgtcatcg acgacggcag ccatgtcaac 
tttccgcacg tccgcccagg tggtttgtac 
cccggctacg gcggtcgcga tggggaaccc 
aaagaactga tcgacggcct gcattatcag 
tcctacacgg aacggaacgt ggcggccctg 
aaagggctca acgctgagcc tgccgcgccg 
gaggacggct gagccgttca ccagctgcgg 
tggatgggtt ccgtgatccc gagttccgcg 
ggcagaaaat agtcgtcgat gatgacgaat 
cgcaatgagt ccattgtgga ttcgtagagg 
tggatggggg cggtgggcaa ggtgtcccgg 
tcgagcagcc cgtagcgggc gaagttctgc 
tcgttgtact cgtgcagcgc catagcctgg 
ggcattccct ggaaggaatc cactacccag 
agaaccgcgc gcatgaagat gcatgcgccg 
ccgggaacac cgtctgcgag cacggcttcc 
agaccgatca tcgtgtgcgc gacagttggc 
ctgtagtcgg tgttgtcctg ccaggcgttc 
gtgagtacct tcttgagcag gtccaggtac 
gatccagctg ttctcgggtg actagttcat 
gtccgcacca gcgcggcggg atggggcatg 
gcagcagcga agccggtgtc gccgagcacc 
tcggactggg cctgctcatc cggcaagcag 
gcgtagtcga actggtcgaa gtactgggga 
gcggtgaatg ccgttcccga gccgcccgcg 
aggttgagcg ggaccgattc ggcgatccgg 
cggtgctcgg ggggaacggc gatcacggcc 
actgcgcgca gcagcggagc cggcccggtg 
cagacccgcc gtgctgaggt gcgcgccgcg 
tacggcacgt actggaccgg tgcgccttgc 
cagggatcga ggatgagctc gggagtgggc 
gggtcaagca actcgtgggc tcgatcgctg 
cggtgcagca cgaccggcag gtcgagcaat 
acgtcgacca acagcactga cggtcgccag 
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gcactgtcgc gggcggccgt ggcaggagct 40320 
ttcggagcca aaacctggtg acgatcccga 40380 
accacaagtc gtgaccggcg cccgtggagt 40440 
cggcctggtc gaccacgacg acctcgacgc 40500 
ggcgcgacag cgggccgtat cccccgccga 40560 
atgcgccggt cgggatcgtc acgccccagt 40620 
gggcggcgcc cgcctcgatc gcgaatgccc 40680 
gctcggagag gtcgacgagc actcggatcg 40740 
gcccgccgct gcggacccct acccgcctgc 40800 
cgacgtcttc ggcggagctg gcgatgtgga 40860 
tgtgcccgac gacgaggtcc ggataacgag 40920 
ctggggttcg acgattcatg ggtgccgggt 40980 
tgactggtcc actttgttcg cccgcagccg 41040 
tgtgtttcag ctcatacgtg aaacccggtc 41100 
atcgcggtgc gcatggtgcc gtatgcgctg 41160 
gagatcgcag ttgccccctg gtcggtggtg 41220 
ccggcgaagc tccaggaagc agtgcaggtg 41280 
gtcgacgaac tcgtcgtacg ctgcgatccg 41340 
ctggagatca cttctggcgc tcagctggtc 41400 
ctgcggctcg cggcggtcgc cgaagcagct 41460 
ctggaagggc tcttcggccc ggttgacggc 41520 
tcggacagca tgacgcagtt ctcgcccgac 41580 
gcgttccgga acagggtgtc caccgcggtg 41640 
cgcgcggacc tcggtgcgct ggcagtccgc 41700 
tggtacaccg aacactacga gcaccacttc 41760 
ttggaaatag gaatcggtgg ttatcacgca 41820 
tggcagcggt acttccggcg aggtctcgtt 41880 
aacgaagggc accgagtgcg aaagctgcga 41940 
gacatggcgg ggaagatcgg gccgttcgac 42000 
gaccacgtca agaaatcctt ccaatccctg 42060 
gtcatcgagg atctccagac ggcgtactgg 42120 
gcggcccagc gcacctcgat cgacatgctc 42180 
gagcgcgaat cgcggtgcgg gaccgagccc 42240 
cacttctacc acaacctggt attcgtggag 42300 
gggttcgtgc cccggcaagc gctcggcgtc 42360 
cgccagtagg cgcccgtgcc gtcgatgtcg 42420 
cggaacccct tcaccgcgtc ctggcaggac 42480 
ccgcccggcg agagcttcgg gtacaggttc 42540 
tcgccgtcga gtcgtagcac ggcgagttcc 42600 
aaccagccgg ggaggaacct gacctgttcg 42660 
cggacggtct caagcgatac gccaagcacg 42720 
tccgcttggt ggtcttgcgc agagctttcc 42780 
acggtacgtc cggtatctcc gaatgcctgg 42840 
ccccgccaga caccggtctc ggcgaaatcc 42900 
acgcagtgct ggaggttgtc cagccgctcc 42960 
cagtccgtgc ctttggcccg agcggcctgc 43020 
ggatgcggcc gatcactgta aatcgtgttg 43080 
agcgcgttct gggagggcat cggttctccg 43140 
caggcacgga tggccgcagt gttctccagc 43200 
gccgtgatct cgtcgctgag tttgattgcc 43260 
gttgcgattg agtcggtgaa ctgttcgtgg 43320 
atgcccgccc cggcagcggc gaggttgcgc 43380 
agcacgagtt gcgggatgcc gagtcgggtc 43440 
cagatgacca gctcgcaggt acgcaggaac 43500 
gcgttgtcgg gtaggtcggt gagaagtgcc 43560 
tcgacgccgg gcaactcggt ggcagccgct 43620 
gcgttcagca ccatgcggcc catgcagatg 43680 
ccccatgccg ggaatgcgcc gcttccgttg 43740 
ggcgcgtcgc ttgcttgcag gctcggcgga 43800 
aggccggtca gtccgtggtg ccggcacacc 43860 
aaggggcctg cggtggggtc gactccccag 43920 
ccgccgagca cccggccgat cagcgcgcag 43980 
gcctcggcca gtcgaaggta ttcggggagc 44040 
Page 12 



tgatcgagcg agctttgcgc gacattggac 
tcggtgtcgc gctgaccgaa cgccggattg 
tcgccggtcc tgtcgttccc gcggatcccg 
gcctgcagct ctggtggtgc ggcgatcagg 
cacagcggca ccattgccat gagatgcgtc 
atacttcgga ccccagtctc tttcccccga 
aggatttgga aaatgcgctg cgtatgtcga 
gcaacaatag tgtgtcacgg caggaatgtc 
cacctggttt cacgcgaacg agtgaaatgc 
cctgtacggt gatcaccgtt ggttctgcgg 
ttgcctcctg gccggatagt tatagtcggt 
actgacggct agtttgccgt cttttctctc 
acgagaaggt gaacgttgca gagatcaggc 
tcgcagcagg ttgggcagat gtatgacctg 
ggcccctgcg ccatccacca cggctactgg 
gccgccgacc ggctcaccga ccttgtcgcc 
ctgctcgatg tggggtgcgg taccggacaa 
atccagatca ccggcatcac cgtcagccag 
cgcgaacgcg gactaagcca ccgggtggac 
tacccggaca atgctttcga cgccgcctgg 
ccggaccgtg ccatccggga aatccttcga 
accgaggtcg tcaaacgaga agcgggcggc 
accggccttc ggatctgcct ggctgagcaa 
gagatcctcg attgggagga cgtgtcgtcg 
gaagagctcg ctgcgcacca gcacgggatc 
tgggccgccg cggtctgcga ttatgagaaa 
acggcgcgga agccggtcgg ctgagggcgc 
tcggggaatc gccggtggcg gcgccagcag 
atcggagggg cagtaccgaa tgagcgaagc 
ctgccgccta ccccaggcgc ctgacccggc 
ggacgccatc accacggtcc cggaagggcg 
gcccaagggc ccggaatggg gtggtttcct 
cttcgggatc tcgccgcgag aagcggcagc 
gctcgcctgg gaggcactcg aagacgccgg 
cgccggtgtg ttcatggggg cgatctctga 
gccggaagtg gctgcgcagt accgcctcac 
cgtgtcctat gtgctcggcc tgcgcgggcc 
gtccctggtc ggcgtgcatc tcgccagcga 
actcgccggc ggcgtgaacc tcaacctggc 
cggcgcgctc tccccggacg gtcgctgctt 
ccgtggtgag ggcggcggcc ttgtcgtgct 
cgaccggatc tactgcctca tccgcggcag 
gctcaccgtt ccggcggcgg acgcccaggc 
gggcgtcgac ccggccgccg tgcagtatgt 
ggatcccgtc gaagcagcag ccctcggagc 
cgagctgcgt gtggggtcgg cgaagaccaa 
caccgggttg ctgaagaccg cactcagcat 
tttcaecgcc cccaacccgg aaatcccgct 
tctgcggccg tggccggaga gcgaggggcc 
gggaggcacg aactgccacc tggtgctctc 
tggacccgct gaggcgacca tgccgtgggt 
tgcgcaggcg gcgcgcttgc acacgcacct 
cgcctactca ctggcgacca ctcgatccgc 
cgacgaaccg aaactgctcg ccgggttgaa 
gctgtgccac ggcgcgactt ccggcgagcg 
cagccagtgg atcgggatgg gtaggcagct 
gatgtcggac tgcgccgacg cattggcgcc 
gcgcaacgcg gccggcgctg cgcaccttga 
cgccatcatg gtctcgctcg cggagctctg 
cgtcgggcac tcgcaggggg agatcgcggc 
cgatgccgcc agggtggtgg cggtgcgcag 
cgcgatggcc tcgttgcagc atcccgccga 
cgatcggatc ggcgtggcgg gggtgaacgg 
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gcggtctgct cccacagttg ccggcctgcc 44100 
ggaaagcgca gctgcgtggt tccacccgta 44160 
gccgtggtga gacctgcacc atgcgcggtc 44220 
acctcgtgcc cggatgcttg cagcgcccag 44280 
ggatagggca agggaacgac gagtacgcgc 44340 
ttagcgcagc agcccctact cccattggcc 44400 
tcgccgttga cgtccaacgg acttccggcg 44460 
acgcgaccat cgaagatctt tgggtcgccg 44520 
gcgagctccg ctcgatcggg gtgggccgga 44580 
ggattcatgg ggaagatttg cgctggctgt 44640 
accgccgcat gcggcggtaa ccgcgaatta 44700 
tgtgtgtttc ctgctcggtt ccagaaaatt 44760 
ataccggtgt tgccaggtgg cgcaccaaca 44820 
gtcacgccgt tgctgaactc ggtcgcgggc 44880 
gagaacgacg ggcgggcttc ctggcagcag 44940 
gaacggaccg tgctcgatgg cggcgttcga 45000 
ccagcgctgc gcgtcgcgcg cgacaacgcg 45060 
gtgcaagtgg ccatcgccgc tgattgcgca 45120 
ttctcgtgcg tcgatgccat gtccctgccg 45180 
gccatgcagt cgctgttgga gatgtccgaa 45240 
gtactcaaac ccggtggcat cctcggcgtc 45300 
gggatgccgg tgtccgggga caggtggccg 45360 
cttctggaat cgctgcgtgc agcggggttc 45420 
aggacccggt acttcatgcc gcagttcgcc 45480 
gcggacaggt acgggccggc tgtcgccggc 45540 
tatgcccacg acatgggcta tgcgattctg 45600 
gccgcaattc gatgacgttc atgcgccgtg 45660 
aggctgaact tactggtggt gtgtccagga 45720 
cgggaacctg atagccgtca tcggactgtc 45780 
ttccttctgg cggttgctgc gcaccggaac 45840 
gtggggcgac ccgttgcctg gtcgggatgc 45900 
ggctgatgtc gactgcttcg atcccgagtt 45960 
cgtggatccc cagcagaggc tggctctgga 46020 
tatccccgcc ggcgagctgc gcggtactgc 46080 
cgactacgcc gccctgctgc gcgagagccc 46140 
cggcacccat cgaagcctga tcgccaaccg 46200 
aagcctgacg gtggattcag gtcagtcctc 46260 
gagcctgcga cggggtgagt gcacgatcgc 46320 
cgccgagagc aacagcgctc tgatggactt 46380 
caccttcgat gtgcgggcga acggttacgt 46440 
gaagaaggcc gatcaggcgc acgccgatgg 46500 
cgcggtcaac aacgatgggg gcggtgccgg 46560 
ggagctgctg cgccaggcat accggaacgc 46620 
cgagctccac ggcagcgcga ccagggtcgg 46680 
tgtcctgggg gcggcgagac ggcccggcga 46740 
cgtcggccat ctggaagcag cggcgggcgt 46800 
ctggcaccgc gaactgccgc cgagtcttca 46860 
ggacgaattg aacctacgcg tccagcgtga 46920 
gctgctggcc ggcgtcagcg ccttcggaat 46980 
cggcacgtcc cgggtggagc gacggcgcag 47040 
cttgtcggcc agaacaccgg tcgcattgcg 47100 
caatacggcc ggtcaaagtc cgttggacgt 47160 
gctgccgcac cgggccgcgc tggtcgcgga 47220 
ggccctcgct gacggcgacg acgcgcccac 47280 
ggcagcggtc ttcgtctttc ccggacaggg 47340 
gctcgaaacc tccgaggttt tcgcggcgtc 47400 
gcacctggat tggtccctgc tggatgtgct 47460 
ccacgacgat gtcgtccagc ccgcgctgtt 47520 
gcgttcgtgg ggcgtgcgtc cggtggcggt 47580 
ggcctgcgtc gccggggccc tgtccgtccg 47640 
caggcttctg acggcgctgg ccggcagtgg 47700 
agaggtgcgg caaatcctgt tgccctggcg 47760 
accgtcgtcg accctggtgt caggggaccg 47820 
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ggaggcgatg gcggaactgc tggccgagtg cgcagaccga gagctccgga tgcgccggat 47880 
tcccgttgaa tacgcctccc attcgcctca catcgaggtt gtccgggatg agctgctggg 47940 
gctgttggcg ccggtcgaac ccaggacggg aagcatcccg atctattcga cgacgaccgg 48000 
ggacctgctg gaccggccga tggacgccga ctactggtac cgcaaccttc gtcaaccggt 48060 
gctgttcgaa gcggccgtcg aggccctgtt gaagcggggg tacgacgcat tcatcgagat 48120 
cagcccacac ccggtgctga ctgcgaacat ccaggaaacc gccgtgcgag cagggcggga 48180 
ggtagtggcg ctcgggacac tccgccgcgg cgaaggtggc atgcggcagg cgctgacgtc 48240 
gctggccaga gcacacgtac acggagtggc cgcggactgg cacgcggtct tcgccggtac 48300 
cggggcgcag cgggtcgacc tgccgacgta cgcctttcag cgacagcgct actggctgga 48360 
cgcgaagctt cccgacgtcg ccatgcccga gagcgacgtg tcgacggcgt tgcgggaaaa 48420 
gctgcggtct tcgccgaggg cggacgtgga ctcgacgacc ctcacgatga tccgggcaca 48480 
ggcagccgtg gtcctcggcc actccgatcc gaaagaggtg gacccggatc ggacgttcaa 48540 
ggacctgggc ttcgattcct cgatggtggt cgagctgtgc gaccgcctaa acgccgccac 48600 
aggtctgcga ctcgcaccga gcgtcgtttt cgactgtcct acgccggaca agctcgcccg 48660 
ccaggtacgg acgttgttgt tgggcgagcc ggctcccatg acgtcacacc ggccggactc 48720 
cgatgcggac gagcctatcg ccgtgatcgg gatgggctgt cggtttccgg gtggggtgtc 48780 
ctcgcccgag gagttgtggc agttggtcgc cgctgggcgg gacgtcgtgt ccgagttccc 48840 
ggctgaccga ggttgggacc tggagcgtgc ggggacatcg cacgtgcgcg ccggcgggtt 48900 
cttgcatggc gccccggatt ttgaccccgg gttcttccgg atttcgccgc gcgaggcgtt 48960 
ggcgatggat ccacagcagc ggttgctgct ggaaatcgcc tgggaagcag tcgaacgagg 49020 
cgggatcaac ccgcagcacc tgcacggaag tcaaaccggg gtcttcgtcg gcgcgacctc 49080 
cctggactac gggccacgcc tgcacgaagc gtccgaggag gcggccgggt acgtgctcac 49140 
cggcagcacc acgagtgtgg cgtcgggtcg ggttgcgtat tcgttcgggt tcgagggccc 49200 
tgcggtgacg gtggatacgg cgtgttcgtc gtegttggtg gccctgcatt tggcgtgtca 49260 
gtcgttgcgt tcgggtgagt gtgatctggc gttggccggt ggtgtgaccg tgatggccac 49320 
gccggggatg ttcgtggagt tttcgcggca gcgtggtttg gcgccggatg ggcggtgcaa 49380 
gtcgttcgcg gaggccgccg acggcaccgg ctggtccgag ggtgctggcc tggttctact 49440 
ggagcggttg tcggatgccc ggcggaatgg gcatgaggtg ctggcggttg ttcgtggtag 49500 
tgcggtgaat caggacggtg cgtcgaatgg tttgaccgcg ccgaatggtt cgtcgcagca 49560 
gcgggtgatt gcccaggcat tggcgagtgc ggggttgtcg gtgtccgatg tggatgctgt 49620 
ggaggcgcat gggacgggca cgcggcttgg tgatccgatc gaggcgcagg cgctgatcgc 49680 
cacctacggc cagggccggc ttccggaacg gccattgtgg ttgggctcga tgaagtcgaa 49740 
catcggtcac gcgcaggcag ctgcggggat agccggcgtc atgaagatgg tgatggcgat 49800 
gcggcacggg cagctaccgc gcacgttgca cgtggatgag ccgacttctg gggtggattg 49860 
gtcggcgggg acggttcaac tccttacgga gaacacgccc tggcccggga gtggtcgtgt 49920 
tcgtcgggtg ggggtgtcgt cgttcgggat cagtggtact aacgcgcacg tcatcctcga 49980 
acagcccccg ggagtgccga 50000 



<210> 2 
<211> 50000 
<212> DNA 

<213> saccharopolyspora spinosa 
<400> 2 

gtcagtctgc ggggccgggt tcgggctctg tcgtggatgt tccggtggtg ccgtggatgg 60 

tgtcgggcaa aacacccgaa gcgctatccg cgcaggcaac ggcgttgatg acctatctgg 120 

acgagcgacc tgatgtctcc tcgctggatg ttgggtactc gctggcgttg acacggtcgg 180 

cgctggatga gcgagcggtg gtgctggggt cggaccgtga aacgttgttg tgcggtgtga 240 

aagcgctgtc tgccggtcat gaggcttctg ggttggtgac cggatctgtg ggggctgggg 300 

gccgcatcgg gtttgtgttt tccggtcagg gtggtcagtg gctggggatg ggccgggggc 360 

tttaccgggc ttttccggtg ttcgctgctg cctttgacga agcttgtgcc gagctggatg 420 

cgcatctggg ccaggaaatc ggggttcggg aggtggtgtc cggttcggat gcgcagttgc 480 

tggatcggac gttgtgggcg cagtcgggtt tgttcgcgtt gcaggtgggc ttgctgaagt 540 

tgctggattc gtggggggtt cggccgagtg tggtgttggg gcattcggtg ggcgagttgg 600 

cggcggcgtt cgcggcgggt gtggtgtcgt tgtcgggtgc ggctcggttg gtggcgggtc 660 

gtgcccggtt gatgcaggcg ttgccgtctg gcggtgggat gctggcggtg cctgctggtg 720 

aggagctgtt gtggtcgttg ttggccgatc agggtgatcg tgtggggatc gccgcggtca 780 

acgctgcggg gtcggtggtg ctctctggtg atcgggatgt gctcgatgac cttgccggtc 840 

ggctggacgg gcaagggatc cggtcgaggt ggttgcgggt gtcgcatgcg tttcattcgt 900 

atcggatgga tccgatgctg gcggagttcg ccgaattggc acgaaccgtg gattaccggc 960 

gttgtgaagt gccgatcgtg tcgaccttga ccggagacct cgatgacgct ggcaggatga 1020 

gcgggcccga ctactgggtg cgtcaggtgc gagagccggt ccgcttcgcc gacggtgtcc 1080 
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aggcgctggt cgagcacgat gtggccaccg 
cggcgctgat ccaggaatgt gtcgccgcat 
cggcgatgcg caggaaccag gacgaggcgc 
acgtacgtgg tggtgcggtg gactggcggt 
tcgagctgcc cacctacgcc ttccaacgac 
cttccgccgg cgacatgggc aggcgtgtcg 
aagatgtgga atcgcttgca cgcgtattgg 
ccctgagaag cgcccttccg gtgttggccg 
ttatggatca gcggtgttac cgaattggct 
gaactgtttt cggtacctgg ctggtcttcg 
tggtggactg cgttacggca ctgcgggcac 
atcccgaccc gacctccttc ggcgaccggg 
ttgttggcgt gttgtcaatg ttgtgcttgg 
tgtcacgggg ttttgcgttg accgtggagt 
ctgcccggtt gtggttgctg acgtgtggtg 
cagcgcaggc cctggcgtgg gggttggggc 
ggggcggctt gatcgatatt ccggtcttgt 
ttgtgctggc aggtctcgat gaggacgagg 
gtcggttggt acgccacact gtctcagctg 
cggtgctggt gacgggcggc acgggtggtt 
acgccggagc cgaacatgtg gcgatggtga 
agaagttgcg gacggaactg gaggatctgg 
tgaccgatcg cgaggcgctc gccgaagtgc 
ccgcggtagt gcatgcggca ggcgtgatcg 
ctgatttcga tcacgtgttg tccgcaaagg 
tggccgatgt ggaattggac gcgttcgtct 
ctgggggaca cggggcttac gcagcggcga 
gtcggtcgcg agggctggtc gcgactgcgg 
tggcctccgg agaaacagga gaccagctgc 
agcacgccat cgccggaatc cggcaggccg 
ccgatgtcga ttgggcacgt ttcagcgcgg 
tgaacgaact ggccgaggtc aaggaactcc 
ttgccgacgc gtcgttggaa tggcggcagc 
aacagctgat cctggagctg gtacgcggcg 
cagcggccgt tgcatcggaa cgagccttca 
tcgaactccg cgttcggctc aatcgagcta 
tcagccatcc cacgcctgcg gaactggctg 
ccgcaggagc aggcattccc gaggaggacg 
tcccgttccc ggccttgcgc gaggcaggct 
acccggtcga ctccggtatc tcctcggacg 
atgtagccgg cctcgtcgaa gcagcgctgg 
ggagatgacg gtgaccacca gttacgaaga 
ggagaacgaa cgcctccggc gcggcaggga 
cgcgatcgtg gcgatgagtt gtcgttatcc 
gcaactggct gccggcggtg tggacgcgat 
cctggatggc gtgttcgttc cggactccga 
cggttttctt cagggcgtgt cggagttcga 
ggcgctggcg atggatccgc agcagcggtt 
gcgggctggg ctggagcagc ggtcgacacg 
caatggccag gactacgcgt cgtggttgcg 
gctgacgggc ggtgcggcag cggttctttc 
gggtcctgcg gtgacggtgg atacggcgtg 
ggggcaagca ctgcgggccg gtgagtgcga 
gtcgacgccg aaggtgttcc tggagttctc 
gtgcaagtcg ttcgcggcgg gtgcggatgg 
gttgctggag cggttgtcgg atgcccggcg 
tggtagtgcg gtgaatcagg acggtgcgtc 
gcagcagcgg gtgattaccc aggcgttggc 
tgctgtggag gcgcatggga cgggcacgcg 
gatcgccacc tacggccgtg atcgtgatcc 
gtcgaacatc ggtcatacgc aagcggcggc 
ggcgatgcgg cacgggcagc tgccacgcac 
ggattggtcg gcggggacgg ttcaactcct 
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ttgtcgagct cggtccggac ggggcgttgt 1140 
ccgatcacgc cgggcggctg agcgcggtcc 1200 
agaaggtgat gacggccctg gcacacgtcc 1260 
cgttcttcgc cggtacaagg gcgaagcaaa 1320 
agcggtactg gctgaacgcg ctgcgtgaat 1380 
aagcgaagtt ctggggcgcc gtcgagcacg 1440 
gcattgtgga cgacggcgct gctgtggatt 1500 
gttggcagcg aacccgcacc accgagtcca 1560 
ggcggcaggt agccggactc ccgccgatgg 1620 
cgcctcatgg ctggtccagc gaaccggagg 1680 
gtggtgcctc ggtggtgttg gtggaagctg 1740 
tacgaaccct gtgttcgggc cttccggatc 1800 
aagaatcggt ccttccggga ttttctgcgg 1860 
tggtgcgggt tttgcgggca gctggtgcga 1920 
gcgtgtcggt gggagatgta ccggttcgtc 1980 
gtgttgtggg gttggagcat ccggactggt 2040 
tcgacgaaga cgctcaagag cggttgtcga 2100 
tcgcgatccg tcctgacggc atgttcgcgc 2160 
atgtgaagaa ggcgtggcgc cccaggggat 2220 
tgggggcgca cgttgctcgc tggctggccg 2280 
gtcgacgcgg cgagcaggca ccgagtgctg 2340 
gtacccgggt gtcgatcgtg tcatgcgatg 2400 
tgaaagccct tccggctgaa aacccgttga 2460 
agactggtga tgcggcggca atgagcctgg 2520 
tggccggtgc cgcgaatctg gatgccttgt 2580 
tgttctcatc ggtgtcagga gtttggggcg 2640 
atgcctatct ggatgcgctc gcggaacagc 2700 
tggcctgggg gccgtgggcc ggcgagggca 2760 
gccgatacgg cctttcccca atggctccgc 2820 
tggaacagga cgaaatttcc ctggtagtgg 2880 
gattgctggc ggctaggccg cggccgctgc 2940 
tcgtcgatgc ccagcccgag gcgggagtcc 3000 
gattgtccgc ggcaccgagg ccgacacagg 3060 
aaaccgctct ggtgctggga caccccgggg 3120 
aggacagcgg attcgactcg caggccgcgg 3180 
ccggcctcca gttgccatcg acaattatct 3240 
cggagctgcg ggcgaggctt cttcccgagt 3300 
aggcgcgaat cagagcggca ctgacgtcga 3360 
tggtgagtcc gctgctcgca cttgccggac 3420 
atgcggccgc gacctcgatc gatgcgatgg 3480 
gcgaacgcga gtcctgagac cgccgacctg 3540 
agttgtcgag gcactgcgag catcgctcaa 3600 
tcggttctcc gcggagaagg acgatcccat 3660 
cggtcaggtc tcctcgccgg aggacctgtg 3720 
ctccgaagtt ccgggggatc gcggatggga 3780 
tcgtcctggc acgtcgtatg cctgcgcggg 3840 
cgcgggtttc ttcgggattt cgccgcgtga 3900 
gctgctggaa gtcgcgtggg aggtcttcga 3960 
cggttcccgc gttggcgtgt tcgtcggcac 4020 
gacgccgccg cctgcggtgg caggtcatgt 4080 
gggccgggtt gcgtattcgt tcgggttcga 4140 
ttcgtcgtcg ttggtggcgt tgcacctggc 4200 
ccttgccctt gccggtggcg tcacggtgat 4260 
ccgccaacgg ggtctcgcgc cggatgggcg 4320 
cactggatgg ggtgagggtg ccggactgtt 4380 
gaatgggcat gaggtgctgg cggttgttcg 4440 
gaatggtttg accgcgccga atggttcgtc 4500 
gagtgcgggg ttgtcggtgt ccgatgtgga 4560 
gcttggtgat ccgatcgagg cgcaggcgct 4620 
tggccggccg ttgtggttgg ggtcggtcaa 4680 
gggtgtggct ggtgtgatca agatggtgat 4740 
gttgcacgtg gaatcgccgt cgccggaggt 4800 
tacggagaac acgccctggc ccaggagtgg 4860 
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tcgtgttcgt cgggtggggg tgtcgtcgtt 
cctcgaacag cccccgggag tgccgagtca 
ggatgttccg gtggtgccgt ggatggtgtc 
ggcaacggcg ttgatgacct atctggacga 
gtactcgctg gcgttgacac ggtcggcgct 
ccgtgaaacg ttgttgtgcg gtgtgaaagc 
ggtgaccgga tctgtggggg ctgggggccg 
tcagtggctg gggatgggcc gggggcttta 
tgacgaagct tgtgccgagc tggatgcaca 
ggtgtccggt tcggatgcgc agttgctgga 
cgcgttgcag gtgggcttgc tgaagttgct 
gttggggcat tcggtgggcg agttggcggc 
gggtgcggct cggttggtgg cgggtcgtgc 
tgggatgctg gcggtgcctg ctggtgagga 
tgatcgtgtg gggatcgccg cggtcaacgc 
ggatgtgctc gatgaccttg ccggtcggct 
gcgggtgtcg catgcgtttc attcgtatcg 
attggcacga accgtggatt accggcgttg 
agacctcgat gacgctggca ggatgagcgg 
gccggtccgc ttcgccgacg gtgtccaggc 
cgagctcggt ccggacgggg cgttgtcggc 
tcacgccggg cggctgagcg cggtcccggc 
ggtgatgacg gccctggcac acgtccacgt 
cttcgccggt acgggagcga aacaaatcga 
gtactggctg gtgccatcgg attccggtga 
gcatccgctg ttgggtgctg tggtgccggt 
caggatttcg gtgcggacgc atccgtggct 
cgttgcgggc accgcgttgc tggagatcgc 
acgggtggaa gagctcaccc tggaagcacc 
ggttcagctg cgagtgggcg cgcccgagaa 
acgccccgaa ggggcggcgg agcatgactg 
aggccgcggc gaggcggctg gagacctggc 
cgacctcgac gaattctatc gggacctcgc 
ccaagggctc aaggcggcct ggcggcaagg 
gggaacggaa gattctggtt tcggggtgca 
aacggctgtc cgagacatgg atgacgcacg 
gcacgccaag gccgcgccgg ctttgcgggt 
gtccctgctg gtttgtgatg gcaccggtcg 
gcggtcggct gcggcccggc ggaccggtgc 
ccggttgagc tggccaacgg ttcaactgcc 
tctcggcacc tcagaagtgt ccgctgacat 
ggctgcgttg gatgccggtg ccgaaccacc 
cggtggacga acagcggatg tccgggagac 
ctggctttcc gatcagcgac tcaacgaatc 
ggccgtggag ccgggcgaac ccgtgaccga 
gcggtcgacg cagaccgaac accctgatcg 
gcaactcctc cccgcgctgc cgggggtgct 
acgtggcggc gctcatgcgc ccagactggc 
gccggacggc accgggtggc gattggaggc 
attggtggac gaaccgacgg ccacggcacc 
gcgcgcggcc ggggtgaact tccgggatgc 
ggcatcgctg ggcagtgagg gcgccggggt 
cctggcaccc ggcgaccgcg tgatgggaat 
cgccgaccat cgcatggtga cgaggattcc 
ggtgccgatc gtctttctca ccgcctacta 
aggggagtcg ttgctggttc attcggccgc 
cgccaggcac ctcggtgcag aggtgtacgc 
ggagctgagc cgagaacacc tcgcttcgtc 
cggggcaacc ggcggacgcg gcgtcgacgt 
cgatgcgtct ctgcgaatgc tgccgcgcgg 
tgttcgtgac cccgtcgagg tcgccgatgc 
taccgtagag gcaggcccgc agcgaatcgg 
cgagggacgc gtgctggagc ccctgcctgt 
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cgggatcagt ggtactaacg cgcacgtcat 4920 
gtctgcgggg ccgggttcgg gttctgtcgt 4980 
gggcaaaaca cccgaagcgc tatccgcgca 5040 
gcgacctgat gtctcctcgc tggatgttgg 5100 
ggatgagcga gcggtggtgc tggggtcgga 5160 
gctgtctgcc ggtcatgagg cttctgggtt 5220 
catcgggttt gtgttttccg gtcagggtgg 5280 
ccgggctttt ccggtgttcg ctgctgcctt 5340 
tctgggccag gaaatcgggg ttcgggaggt 5400 
tcggacgttg tgggcgcagt cgggtttgtt 5460 
ggattcgtgg ggggttcggc cgagtgtggt 5520 
ggcgttcgcg gcgggtgtgg tgtcgttgtc 5580 
ccggttgatg caggcgttgc cgtctggcgg 5640 
gctgttgtgg tcgttgttgg ccgatcaggg 5700 
tgcggggtcg gtggtgctct ctggtgatcg 5760 
ggacgggcaa gggatccggt cgaggtggtt 5820 
gatggatccg atgctggcgg agttcgccga 5880 
tgaagtgccg atcgtgtcga ccttgaccgg 5940 
gcccgactac tgggtgcgtc aggtgcgaga 6000 
gctggtcgag cacgatgtgg ccactgttgt 6060 
gctgatccag gaatgtgtcg ccgcatccga 6120 
gatgcgcagg aaccaggacg aggcgcagaa 6180 
acgtggtggt gcggtggact ggcggtcgtt 6240 
gctgcccacc tacgccttcc aacgacagcg 6300 
tgtgacaggt gccggtctgg ccggggcgga 6360 
cgcgggtggt gacgaggtgt tgctgaccgg 6420 
ggccgaacac cgggtgctgg gtgaagtgat 6480 
cttgcacgcg ggggaacgtc ttggttgtga 6540 
gctggtcctg ccggagcgcg gggcgatcca 6600 
ttccggacgc aggccgatgg cgctgtattc 6660 
gacgcggcac gccacgggcc ggttggcgcc 6720 
cgactggccg gctcctggcg cgctgccggt 6780 
agagcttggg ctggagtacg gcccgatctt 6840 
ggacgaggtg tacgccgaag ccgcgctgcc 6900 
tccggcactg ctggacgcgg ctctgcacgc 6960 
cttgccgttc cagtgggaag gtgtgtccct 7020 
ccgcgtggtc ccggctggtg acgatgccaa 7080 
accggtgatc tcggtggacc gactcgtatt 7140 
gcgccgacag gcccatcaag ctcggttgta 7200 
gacatccgct cagccaccgt cctgcgtgct 7260 
acaggtgtat ccggacctcc ggtcgttgac 7320 
cggcgtcgtc atcgcaccca cgccccccgg 7380 
gactcggcat gcactcgacc tggtacaagg 7440 
ccgattgctc ctggtgacac agggagcagt 7500 
tctggcgcag gccgcgctct ggggactgct 7560 
cttcgtcctc gtcgatgtgc ctgagcccgc 7620 
ggcctgcggc gaacctcagc tcgcgttgcg 7680 
tggactgggc agcgatgacg tcctgcccgt 7740 
cacgcgcccg ggaagcctgg atgggttggc 7800 
gctgggtgac ggtgaggtca ggattgcgat 7860 
gctcatcgcg ctcggtatgt atcccggtgt 7920 
cgtggtggag accggccccg gcgtcaccgg 7980 
gatcccgaag gcgttcgggc cgctcgcggt 8040 
cgctggttgg agcttcgcgc gggccgcatc 8100 
cgcgctggtt gatctcgccg ggttgagacc 8160 
cggtggggtg gggatggccg cgatccaact 8220 
caccgctagc gaggacaagt ggcaagccgt 8280 
gcggacgtgc gatttcgagc agcagttcct 8340 
cgtgctcaac tccctcgccg gggagttcgc 8400 
tggccgtttc ctggagttgg ggaagacgga 8460 
gcatccgggc gtgtcttacc aggctttcga 8520 
cgagatgctt cacgagctgg tggagttgtt 8580 
cacggcttgg gacgttcggc aggcgcccga 8640 
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ggcgctacgg cacctgagcc aagcgcggca 
ggtgtgggac gccgcaggca cggttctggt 
ggtcgcccgg cacctcgtga tcgagcgcgg 
cggtcccgca gccagtggcg ctgctgagct 
ggtttccttg caggcttgcg atgtcgccga 
catcccggac gagcatccgt tgaccgccgt 
agtgtccgaa tcgctcaccg tggagcggct 
cgcgcggaat ctgctcgagc tgatcgaccc 
gtcgggtgtg ctcggcagcg gtgggcaggg 
cgcattggcg cagcaaaggc agtcgcgcgg 
ctgggcggaa catggcatgg ccagcacctt 
atctgggttg ctgccgatct cgaccgagga 
cggcgcgcat accgtggtgg cgccggttcg 
caagttctcc gtcctgcaag gtttggtcgg 
ggatgatgcc gagagcctcc ggaaacggtt 
gattctgctg gacctcgtcc gcatgcatgt 
ggagatcacc gcggacggca cgttcaaggt 
gttgcgcaac cggatcaacg gggcgacggg 
ctacccgacg ccggatgcgc tcgccgcgca 
ggccgggaca ttcgaggaac tcgacaggtg 
tgaggccacg cgggcgcaga tcaccacccg 
cgtgtccggc ggaaccggcg gcggctccgt 
cgagcttttc caactcctcg acaacgatct 
tcgcaacttc ctggagggaa acgccacatg 
cggcgtgcgc tcgtggatct gcaccaggcg 
gagcgggaac ccatcgcgat cgtggcgatg 
ccggaagggc tgtggaaact ggtcgcctcc 
gatcgtggtt ggcacctcga cgagctctac 
tacacccggc acggcggctt cctccacgac 
atcagccccc gtgaggcgct cgcgatggac 
tgggagaccg tcgaatccgc tgggatggac 
gtgttcgcgg gattgatgta cgagggctat 
gtcgaaggct atctcggaac cggcaatgcg 
gcgttcgggt tcgagggccc agcggtgacg 
gcgctgcatt tggcgtgtca gtcgttgcgg 
ggagtgacgg tgatgtcgac gccggagagg 
gcaccggatg ggcggtgtaa gtcgttcgcg 
ggtgccggtt tggtgttgct ggagcggctg 
ctggcggttg ttcgtggtag cgcggtgaat 
ccgaacgggc tggcccagga gcgggtcatt 
gcgtccgatg tggacgctgt ggaggcgcat 
gaggcgcagg ctctgatagc cgcctatgga 
ttggggtcgg tcaagtccaa catcggtcat 
atcaagatgg tcatggcgat gcggcacggg 
ccgaattcgc acgtggactg gtcggctggt 
tggccaggga cgggtacgcg ccgcgctgga 
gcacacgtca tcctcgaaca cgacccgctc 
tccccagcac ctgggatcgt gccctgggcg 
gcccaggccg aacggctgcg cgagctgtgc 
ggtttctcac tggccgccac gcgcacggct 
gacagcgcta cgttgcgctc cgggcttggc 
gtcgttgagg ggagcgtcct ggacggcgag 
cagtgggccg gtatggcagt cgacctgctg 
gacgagtgcg ccaccgcgct gcggaggtac 
ggagcggaga actccccacc gctggaccgg 
gtgatggtgt cgctcgccga ggtgtggcgt 
ggccacagtc aaggcgaaat cgccgcggcc 
gcggccaggc ttgtcgcatt gcgcagcaga 
atggcgtcgc tggcctgccc tgcggatgag 
cgtctggaag ttgcggcgat caacggcccg 
gcggtggacg aactgctggc agagtgcgct 
gtcgactacg cctcgcattc agcgcacgtg 
gccgccgggg tgcgacaccg ggacggccag 
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tgtgggaaag ctggtgctca ccatgcctcc 8700 
taccggcgga acgggagcac ttggcgcaga 8760 
ggtgcgaaac ctggtcctcg tcagcaggcg 8820 
cgtggcgcaa ctgacggcct acggtgccga 8880 
tcgtgagacc ttggcgaagg tgcttgccag 8940 
ggtgcacgcg gctggtgttc tcgacgacgg 9000 
ggaccaggtt ctgcgcccga aggtcgatgg 9060 
ggacgtggcc ctcgtgttgt tctcgtcggt 9120 
taactacgcg gcggccaact ccttcctcga 9180 
cctaccgacg agatcattgg cctgggggcc 9240 
gcgcgaagcc gagcaggatc gattggcgcg 9300 
ggggttgtcc cagttcgacg ccgcgtgcgg 9360 
attcagccgc ttgtccgacg ggaacgcgat 9420 
gccgcatcgc gtcaacaaag cggcgactgc 9480 
gggacgcttg ccggatgcag aacaacatcg 9540 
ggcggcagtg ctcggattcg ccggttctca 9600 
gctgggcttc gactcgttga ccgtggtcga 9660 
gctgcgactg cccgccaccc tggtgttcaa 9720 
cctcgtcacc gcgctgtccg cagaccgcct 9780 
ggcggcgaac ctgcccacgc tggccaggga 9840 
gctacaggcg atcttgcaga gcctggcgga 9900 
gccggaccgg ctcagatcgg ccacggacga 9960 
cgaacttccc tgatgcctca gccggagcct 10020 
tcgaatgaag agaagctccg ggagtacttg 10080 
cgcgagcggc tgcacgaggc ggagtcggga 10140 
ggctgccggt acccgggtgg ggtgcaggac 10200 
ggtggcgacg ccatcggtga attccccgct 10260 
gatcccgacc cggatcagcc cggaacctgc 10320 
gccggcgagt tcgacgcggg attcttcgac 10380 
ccgcagcagc ggctgctgct ggaaatctcc 10440 
ccgaggtcct tgcgggggag ccgcaccggg 10500 
gacaccggcg cccaccgggc aggagaaggt 10560 
ggaagcgtcg cctctggtcg ggttgcgtat 10620 
gtagacacgg cgtgctcgtc gtcgttggtg 10680 
cagggcgagt gtgatctggc gctggccggt 10740 
ttcgtggagt tctcccgtca gcgtggtctc 10800 
gcggctgcgg atggaaccgg ttggggtgag 10860 
tcagacgcca ggcggaacgg gcatcgggta 10920 
caggacggtg cgtcgaacgg attgacggcc 10980 
cagcaggtgc tcacgagtgc ggggctgtcg 11040 
ggaacgggta cgcggcttgg tgatccgatc 11100 
caggatcggg accgggaccg gccgctgtgg 11160 
acgcaggcgg ctgcgggcgt cgctggtgtg 11220 
gagctgccgc gcacgttgca cgtggacgag 11280 
gcggtccgac tcctgaccga gaacatccgc 11340 
gtgtcgtcgt tcggggtaag cggtaccaac 11400 
gccgtgaccg agaacgagga agcagcgcag 11460 
ttgtccgggc ggtcgtcgac ggcgctgcgg 11520 
gagcagaccg atcccgaccc cgtcgatgtc 11580 
tgggagcacc gagcggtggt gcttggtcgg 11640 
gttgttgcca gcggtgaacc agcggtcgat 11700 
gtcgtcttcg tcttccccgg tcagggctgg 11760 
gacgcttcgc cgacgttcgc gcgccacatg 11820 
gtggactggt cgttggtcga cgtgctgcgc 11880 
gtggacgtgc tccagcccgc gtccttcgcg 11940 
tcctacgggg tgaggccggc ggccgtcgtc 12000 
tgcgcagccg gggtgctgcc gctggaggat 12060 
gcgttgaagg gactttcggg gcggggtggc 12120 
gtcgcggcat tgttcgcggg atcgggcggc 12180 
cgatcggtcg tggtgtccgg cgatctggaa 12240 
gaaaaggaca tgcgtgcacg ccgtatcccc 12300 
gaggtggttc ggagcccggt gctggcggcc 12360 
gtgccgtggt ggtcgacggt gatcggcgac 12420 
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tgggtggatc cggccaggct ggacggcgag 
cggttcgaac acgccgtgca gggcctggtc 
agtgcgcatc cggtgctgac cacggcggtc 
gtggccgcgg taggtacctt gcgacgtgac 
ctggccgagg cgtacgtgcg cggcgccacc 
ggccgacggc tggacctgcc gacctacccg 
ggagctgcct ccgacgaggc tcgtgcggtc 
gccgtggcgc gccaagacct gaaaagcgtg 
ccgctgagcg caacacttcc agccctgtcc 
ttggcagacg gttggcggta ccgagtcgac 
agaacgcggg aaacctggct cctggtcgtt 
gaacggctga cggatgcgtt gaacacgcga 
ccggcggcga ccagtggcga actcgcaacc 
gtgaaggcaa tcctgtcgct caccgcgttg 
gtcccgagcg ggattgcctt gctgctgaac 
agaattcctc tgtggaccat cacgcgtggt 
ctgcgcccga tgcaggcgca agcatggggt 
gagcgctggg gtgggctgat cgacctgccg 
ctgggcgaag cgctcaccaa cggcttggcg 
gtgctggccc ggcgactggt acccgccccg 
ccccgaggga gcgcgctgat cacgggcgga 
tggttggccg aaatcggagc cgagcgaatc 
gcaggcgccg ccgagctgga agccgaactc 
gcttgcgacg tgaccgatcg tgccgagatg 
gcggtgttcc acgcggccgg agtcggtcgg 
ggcctggccg aaatatgcgc ggcgaaggtc 
gacagcaccg atctcgatgc cttcgtcctg 
ggcggtcagg gcgcttacgg cgcggcgaac 
cgagcacgcg gtctgccggc aacctcgatc 
gccgacggcg cggcgggcga acacctgcgg 
tcggccatcc tggctctgca ggaagtactt 
gatgtggact gggaccgatt cgttcccacg 
gacgaagtgc cggcggcgag aaaggcgatg 
tcgccgttcg cccgcaatct cgcggagctg 
gatctggtgt gcgcccaggt ggcaaccgtg 
cccgagcggg cgttccgcgc gctcgggttc 
cgtttgacca ccgccaccgg gttgcgcctg 
ccggccgcct tggccgctca cctgctcgag 
gtgaccgctg ccagcgcgcc cgcgagtgac 
cggtttccgg gtggcgcgca ctcgccggaa 
gaggtgatcg gcgagttccc ctccgaccgg 
gatgcttcca ggcctggaac gacgtatgcg 
gagttcgatg ccgacctgtt cggcatcagc 
cagcggttgg tgctcgaaat cgcctgggaa 
tccttgaagg gcagtggggt cggcacgtac 
gatgtgcggc agtttcccga ggaggcggag 
gtgctgtcgg gtcgggtcgc gtattcgttt 
acggcttgtt cgtcgtcgtt ggtggcgttg 
gagtgtgatc tggcgttggc cggtggtgtg 
gagttctccc gtcagcgcgg tttggcgccg 
gcggacggca ccggctgggg cgaaggcgcg 
gcccaccgga atgggcatcg ggtgttggcg 
ggcgcctcga acggactggc ggcgccgaac 
gcactcgcga atgcggctct ttcggcgtcc 
gggaccaggc tgggtgatcc gatcgaggcg 
cgggagcggg atcggccctt gtggctgggg 
gccgcggcgg gtgttgccgg tgtgatcaag 
cccgcctcgc tgcacgcgga tgagcccacg 
cggctcctcg ccgaacaggt accttggccg 
tcgtcgttcg ggatcagcgg caccaacgca 
ccagatagta cagcggagac ggacaaaaca 
gttccctggt tggtgtcggg aaagacgacg 
ttgtctcagg tcgagtcccg gccggagcag 
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tattggtatc ggaacctccg gcagccggtc 12480 
gagcggggat tcggcctgtt catcgaaatg 12540 
gaggaaaccg gtgcggagtc ggagaccgcc 12600 
tcgggcggcc tccggaggtt gttgcattcg 12660 
gtggactggg ccgtggcgtt cgggggcgcg 12720 
ttccagcgcc agcggtactg gctggacaag 12780 
tcggacccgg cggcgggctg gttctggcaa 12840 
tccgatgccc tcgatctcga cgccgacgca 12900 
gtctggcacc gtcaggaacg agaaagggtc 12960 
tgggtacggg tggccccgca gccggtccgg 13020 
cccccgggcg gcatcgagga agcgctggtc 13080 
gggatcagca ccctgcgcct cgacgtgcca 13140 
gaactccgcg ccgcagccga cggtgacccg 13200 
gacgagcgac cccaccccga atgcaaggac 13260 
ctggtcaagg cgctcggtga agccgacctc 13320 
gcggtcaagg caggccccgc agatcggctg 13380 
ctggggcgag tagccgcact cgaacacccc 13440 
gattcgctgg acggcgacgt cctcacgagg 13500 
gaagaccaac tggcgattcg ccagtcgggc 13560 
gcgaatcagc ccgctggacg taagtggcgc 13620 
ctcggcgcgg tgggcgcaca ggtggcgagg 13680 
gtgctcacca gtcgacgggg caaccaagca 13740 
cgggcccttg gagcgcaagt gtccatcgtg 13800 
tccgcactac tggccgagtt cgacgtcacc 13860 
ctgctgccgt tggcggagac cgaccagaac 13920 
cgcggcgctc aggtgctgga cgaactgtgc 13980 
ttctcctcgg gtgccggggt atggggcggg 14040 
gcattcttgg acacactcgc cgaacaacgc 14100 
tcctggggca gttgggccgg cggcggcatg 14160 
cgacgcggga tacgtccgat gccggcggcg 14220 
gaccaggatg agacgtgcgt gtcgatcgct 14280 
ttcgccgcga ctcgcgccac ccggttgttc 14340 
cccgcgaatg ggccggcaga accaggcggc 14400 
ccggaagccc aacgacgcca cgaactggtg 14460 
ctcgggcacg gcagtcgcga ggaagtccag 14520 
gactccctca tggcggtgga tctgcgcaat 14580 
ccgaccacaa ccgtcttcga ctacccgaat 14640 
gagctggtgg gtgatgtcgc gtcggctgcg 14700 
gaaccgatcg cgatcgtcgc gatgagctgc 14760 
gacctgtggc ggctggtcgc cgccggcacg 14820 
ggctgggatg cggaaggcct ttacgatccg 14880 
cggatggcgg gattcctcta cgacgccggt 14940 
ccacgtgagg cgttggcgat ggatccgcag 15000 
gccctcgaac gggccggaat cgatccgttg 15060 
atcggcgctg gaagccgtgg gtacgcgacg 15120 
ggctacctgc tgacgggtac ctcggccagt 15180 
ggtttcgagg gtcctgcggt gacggtggat 15240 
catctggcgt gccagtcgtt gcgttcgggc 15300 
accgtgatgt cgacgccgga gatgttcgtg 15360 
gatgggcggt gcaagtcgtt cgcggagagc 15420 
ggcctgttgt tgctggagcg gttgtcggac 15480 
gtggttcgtg ggtcagcggt gaatcaggac 15540 
ggtccgtcgc agcagcgggt gatcaaccag 15600 
gatgtggatg cggtggaggc acatggcacc 15660 
caggcattga tcgcaacgta tgggcaggcc 15720 
tcggtcaagt cgaacatcgg tcatacgcag 15780 
atggtgatgg ccatgcggca cgggcagctg 15840 
tcggaggtcg attggtcgtc gggggcggtc 15900 
gagtctgacc gtgttcgtcg ggtgggggtt 15960 
catgtgatcc tcgaacaagc tacgaatgcg 16020 
gaatccggat ctactgtcga tattccggtc 16080 
gattccctgc ggggacaagc cgaacgagtc 16140 
cgttcgctgg atgttgccta ctcgcttgct 16200 
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tctggccgag ccgcgctgga tgaacgcgct 
gttgctggac tggcggcgtt ggccgccggt 
cgtgcttctg ctcggttcgg gttcgtgttc 
ggcagagcgc tctactcgaa gtttccggtg 
gagttggagg cacatctggg ggaagaccgc 
gcgcagctgc tggatcagac gctgtgggcg 
ctcttggggc tgctgggttc gtggggcgtt 
ggggagttgg ccgccgcgtt tgcggctggc 
gtggccgcgc gcgcccggtt gatgcaagcc 
gctgctggtg aagaccttgt tcggccattg 
gccgcgctca atgcccccgg ttcggtggtg 
atcgtcggcc ggctgaccga gctccgagtc 
tttcattcgc accggatgga cccgatgttg 
gagttcggta agccaacgac accgcttgtg 
gcggaaatga gcacaccagg gtattgggtg 
gacggtgtcc aggccctggc agcgcagggc 
ggaacgctgg cggcactggt tcgggagtgt 
tcgtcgatcc cactgatgcg cagggagcgg 
gcgcatctcc acacccgtgg tggtgaggtg 
gctaggcagc tcgagttgcc aacgtatgcc 
agtgcgcggc cagcacgcga ccgcgcagac 
gcggttgacc aaggcgatct ggcaacgttg 
gacacatgcg catcgttgag cgatgtattg 
cgcaaccgtt cgctcgtcga ttcctgccgg 
gtgccggccc cgaagatttc cggtacctgg 
ggattggtca cggctttgac gagttcactg 
ggcctgtccg aagaggaccc gcaccgcgag 
acggatgccg gtcaactcgg tggcgtgctt 
ccgggattct cctgcttgcc aactggtttc 
cggaaggccg acgtcgaggc gcctttttgg 
gatgtacgcg tgtctccgga gcaggccctg 
gagcacccgg agttctgggg tggcttgatc 
ggtgcccggt tggcgggtgt gttggcggat 
ggtggtgtgt tcgtgcggcg gttggaacgc 
cgtcctcggg ggacggtgtt ggtgacgggt 
cggtggttgg ccggtgccgg ggctgagcac 
gctccgggcg ctggggaatt gcgggcggag 
gtgccctgcg acgtggctga tcgtgacgca 
gagtgtccgc tgactgcggt ggtacacgcc 
gagatgggtt tggcggattt tgcagcggtg 
ctggacgagt tgctggccga ctcggagctg 
ggggtgtggg gagccggcgg acagggtgcg 
ttggccgagc agcgtcgggc gaggggattg 
gccggtgacg gcatggccgc cggcgaaacc 
tcgatggaac cgagcgcggc gctgctggca 
tccctcgtcg tggccgatgt cgattgggca 
cgacgcccgc tgctggacac catcgacgag 
caagcgggca caggcaaacc cgttgagctg 
gaacgcgacg atgcggtatt ggatctggtg 
gacgatgcca cggccctggc gccatcgcgg 
atggcggtgg agctgcgcaa ccggctgaac 
acgattttcg actaccccaa tgccgagtcg 
ccaacggaga ctaccgtgga ctcggccctt 
tcgatgctca ccggcgaagc gcgggcacgg 
cacgagaagt ggaacagcgc agctgaagta 
gattcggcga cgcacgacga gatattcgag 
gcagttcctg cggaacttca agcgccgaaa 
aaagctcttc ggctatctga agaaggtaac 
gctcgcggcc gagagccgga gtcaggagcc 
gcccggcggc gtcgactctc ccgaagcgct 
catctcggag ttccccgccg accggggctg 
gaaccaccag ggaacgtcgt acacgcgggc 
cgaccccgcc atgttcggga tttcgccgcg 
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gtcgtgctgg gtgcggaccg cggtgagctg 16260 
caggaggctt ctggggtgat cagcggaact 16320 
tcggggcagg gtggtcagtg gttggggatg 16380 
ttcgctgctg cgtttgatga ggcttgcgcc 16440 
cgggttcggg atgtggtctt cggttccgat 16500 
cagtcgggtc tgttcgcgct gcaagccggc 16560 
cggccggatg tggtgatggg gcattcggtc 16620 
gtgttgtcgt tgcgggatgc ggctcggttg 16680 
ctgccctctg acggcgcgat gttggcggtg 16740 
ctggccggtc gggaggagtc cgtgagcgtc 16800 
ttgtcgggcg atcgggaggt gctggccagc 16860 
cggacgcggc gcttgcgggt ctcccatgct 16920 
ggcgagttcg cccagatcgc cgagtctgcg 16980 
tcgacgttga cgggtgagct cgacagagcc 17040 
cgccaggcgc gtgaacccgt ccgtttcgcc 17100 
ataggcacgg tcgtcgagct cggcccggac 17160 
gcgaccgagt ccgatcgggt tgggcggatt 17220 
gacgagaccc gttcggtgat gacagccctg 17280 
gactggcagg cgtttttcgc cggtaccggc 17340 
ttccaacgac agcactactg gatcgagtcc 17400 
atcggcgagg tggcggaaca gttctggacc 17460 
gtcgccgctc tggatcttgg ggcggacgac 17520 
ccggcgttgt cctcctggcg aagcggactc 17580 
taccgaatca gttggcattc ctctcgggag 17640 
ctgttggtcg tgcccggtgc tgcggatgac 17700 
gtcggaggcg gcgccgaggt cgtccggatc 17760 
gacgtcgcac agcggctggc caatgcgctg 17820 
tcgctgttgg ggctcgatga atcgcctgct 17880 
gcgctgactg tgcagcttct gcgggccttg 17940 
gcggtgacgc gcggcggcgt cgcgttggaa 18000 
gtctgggggc tgctgcgtgt cgcgggactg 18060 
gacctgccat cggactggga cgaccgattg 18120 
ggtggcgagg atcaagtcgc cattcgccgt 18180 
gctggtgcgt cgggtgccgg gtcggtgtgg 18240 
ggtacgggcg gtttgggggc gcatgttgcc 18300 
gtggtgttga ccagccgtcg aggagcggac 18360 
ctggaggcgc tgggtgctcg ggtgtcgatt 18420 
gtggctggag tgttggcagg gatcggtggg 18480 
gccggggtcg gcgaggcggg cgacgtagtg 18540 
ttgtcggcga aggtgcgtgg tgcggcgaat 18600 
gatgcgtttg tgatgttctc ctcggtgtcg 18660 
tatgcggctg cgaacgccta cttggatgcg 18720 
gtcgggaccg cggttgcgtg gggaccgtgg 18780 
ggcgcacagc tgcaccggat gggcctggcg 18840 
cttcagggtg cattggaccg cgatgagacc 18900 
cggttcgccc cagccttcac ctcggcacgt 18960 
gcccgagccg cattggaaac caccggcgaa 19020 
acgcaacgcc tggccggact gtcgcggaag 19080 
cgggcggaga cggcggctgt gctgggacgc 19140 
ccgttccagg aactcggatt cgactccttg 19200 
accgccaccg ggatccagct gcccgccagc 19260 
ctgtcgcgtc acctctgcgc cgagcttttc 19320 
gccgagctcg atcgaatcga gcagcagctc 19380 
gaccgaatcg cgacacgact gcgagccctc 19440 
ccgaccggag ccgatgtcct gagcacgctc 19500 
ttcatcgaca acgagctcga cctgtcctga 19560 
tcgggtggaa atcacaatgg ccaatgaaga 19620 
tgcggacctg catcagaccc ggcagcgcct 19680 
gatcgcgatc gtctcggcga gctgccgact 19740 
ctggcaactc gtgcgcactg gcaccgacgc 19800 
ggatctcggc cggttgtacg atcccgaccc 19860 
cggcggtttc ctcgcaggag cgggcgattt 19920 
tgaggcgttg gcgatggacc cgcagcaacg 19980 
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gttgttgctg gagctgtcct gggaggccct 
gcgcggcagc aagaccggtg tcttcggtgg 
gcaggagatg agccgaaacg ctgggggttt 
gtcgggtcgg gttgcgtatt cgtttggttt 
gtgttcgtcg tcgttggtgg ccctgcattt 
cgatctcgcg ctggccggcg gtgtgacggt 
ctcccgtcag cgtggtttgg ctccggacgg 
tggcaccggg tggggtgagg gtgccggtct 
gcggaatggg cacgaggttc tggcggtggt 
gtcgaatggt ttgactgcgc cgaatggtcc 
ggcgagtgcg gggctgtcgg tttccgatgt 
cacgttgggt gatccgatcg aggcacaggc 
gaaggatcgg ccgttgtggt tggggtcggt 
cgctggcgtt gccggcgtca tcaagatggt 
cacgttgcat gtggatgagc ccacgtcggc 
tctcacggag aacacgccct ggccggacag 
gttcgggatc agcggcacca acgcacatgt 
cgaaccggcc gggccggtcg aaggcgagcg 
ttgggtgctg tcgggtaaga caccggaggc 
gcatatcgag gaccggccgg ggctgtcgcc 
acgcgcggcg ctggatgaac gcgcagtggt 
cgggttgagg gcattcgccg acggctgcga 
gcttggtggc cgcgtcgggt tcgtgttctc 
ccgggggctc tactcggtgt ttccggtgtt 
gttggatgca cacctgggcc aggaactgcg 
gtggttgctg gatcggacgg tgtgggcgca 
gctgcggctg ctgggttcgt ggggtgttcg 
tgagctggct gcggtgcatg cggctggtgt 
ggcgggtcgc gcccggttga tgcaggcgtt 
tacgggtgag tttcaggtcg atcctctgct 
ggcggtgaat ggcccggaat cggttgtgct 
cgctgatcgg ttgcacgatc aggggtgccg 
ccattcgccc catatggagc cgatgctgga 
atatcacgca ccggaactgc cgatcatctc 
agtgatgggc actcccgagt actgggtgcg 
gggtgtccag gcgcttgtcg gtcagggtgt 
ggcgttgtcg acgttggtcg aggagtgtgt 
gctgatgcgc aaggaccgcg acgaggcgcg 
cacccgtggt ggtgaggtgg actggcggtc 
cgacctgccc acctacgcct tccagcggca 
gggtgacgtg accgccgccg gattggccga 
tgcgttggca gacggcgaag gtgtggtgct 
gtggttgtcc gatcaccggg tgctgggcga 
gctggtgtgg cacgtcggcg agcgcctcgg 
agcgcccctg atcctgccgg atcatggagc 
cggggaatcc ggagcccggt cggtggcgct 
cgagtggaag aagcacgcga cgggcgtgct 
gctgaccgca tggcccccgg agaatgcgac 
ccttgaaggg cacggtttcg cgtacggacc 
acgaggcggg gaggtgttcg ccgaagtcgc 
tcgattcggc gtccaccccg cgttgctgga 
gtcggtggtc cagagcgaag cgcgggtgcc 
cactgaaagc gcggtggtgc gggcgcgcct 
ggtcgcagtg gacccggctg gccgattcgt 
gatctcccgg cagcaggtga ggtctggcgc 
gcaccggaag gcgttgttgg gaacaaccgc 
tcccagttgg ccggaatcgg tgcgcgcaac 
tgcggccgtg gactcggacg ttcctgcccc 
cgaagaggtc gagggtggat ccctgccgtc 
ggctctcgtg cagtcgtggc ttgcggacga 
cacgcgtgca gcggtgtcgg ccgactcgga 
gtcgtggggg ttgttgagtt cagcccagtc 
cgtggacggc acacctgagt cgtggcaggc 
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cgaacgggcg ggcatagacc cgacatccct 20040 
tgtcacgccc caggagtacg ggccgtcctt 20100 
tggactcacc gggcggatgg tgagtgtggc 20160 
tgagggtcct gcggtgacgg tggatacggc 20220 
ggcgtgtcag tcgttgcgtt ccggcgaatg 20280 
gatggcgaca ccggcgacgt tcgtggagtt 20340 
gcggtgcaag tcgttcgcgg ctgccgcgga 20400 
ggtgttgctg gagcggttgt cggatgcgcg 20460 
gcggggtagc gcggtgaacc aggacggcgc 20520 
gtcgcagcag cgggtgatca cccaggcgtt 20580 
ggatgcggtc gaggcacatg ggaccgggac 20640 
cctgatcgcc acgtacgggc agggccggga 20700 
caagtccaac atcggtcaca cgcaggcggc 20760 
cttggcgatg cggcacgggc agctgcccgc 20820 
ggtggactgg tcggcgggtt cggtccggct 20880 
tggtcgtcct tgccgggtgg gggtgtcgtc 20940 
gattctcgaa cagtctccag tcgagcaggg 21000 
ggaaccggat gtagccgtcc ccgtggtgcc 21060 
tgcgcgggcg caggccgaac gggtgcattc 21120 
ggtggatgtg gcgtattcgc taggaatgac 21180 
gttgggctcg gaccgtgccg cgctcctgac 21240 
tgcgcccgaa gtggtttcgg ggtctgtggg 21300 
gggtcagggt ggtcagtggc cggggatggg 21360 
cgccgacgcg ttcgacgagg cttgcgcgga 21420 
ggttcgggat gtggtgttcg gttcgcaagc 21480 
gtcgggtttg ttcgcgttgc agattggctt 21540 
gccggatgtg gtgttggggc actcggtggg 21600 
gttgtcgttg tcggaggccg cgcggttggt 21660 
gccttctggt ggtgccatgc tcgcggtcgc 21720 
ggatggggtg cgggaccgga tcggtatcgc 21780 
ctctggtgac cgcgagctgc tcaccgagat 21840 
gacccggtgg ttgcgggtgt cgcatgcttt 21900 
ggagttcgcc cagatctccc gaggccgcga 21960 
gaccctgatc ggtgagctgg acggtggtcg 22020 
tcaggtgcgt gagcccgtcc gtttcgccga 22080 
cggcacgatt gtcgaattgg gtccggacgg 22140 
ggcggaatcc gggcgggtgg ccgggatccc 22200 
aaccgtgctg gcagctttgg cgcagatcca 22260 
gtttttcgcc ggtaccgggg cgaagcaagt 22320 
gcggtactgg ctggcatcca ccgggcgtgc 22380 
ggcggaccat ccgctgctcg gtgcggtggt 22440 
gaccggtcgg ttgacagcgg gttcgcatcc 22500 
aatcgtcgtc cccggcaccg cgatcgtcga 22560 
ttgtggccgg gtggaagaac tggctttgga 22620 
ggtccaggtt caggtgctgg tgggaccgcc 22680 
ctactcctgt cctggcgagg cgatcgaacc 22740 
tctcccaccc gtggccgccg agaaccatga 22800 
cgaaatcgat gcagacgggg tctacgcatt 22860 
ggcctttaga tgtctgcgcg gtgcctggcg 22920 
attgccggat gacatgcagg cgggggtcga 22980 
cgcggttctg catgccgccg cagccgagac 23040 
gttctcgtgg cgtggggtgg aacttcgcgc 23100 
ctcgttgact tcggatgacg aactgtcgtt 23160 
ggccacggtt gattcgctgg tgacccgacc 23220 
gatcggtgat tgcctgttcg aggtggagtg 23280 
cggcgacgac cttgccatcg tcggtgacgg 23340 
cgcacggttc gcgaccctgg atgagttccg 23400 
gggttcggtg ttggtcgcag ctatgtcggc 23460 
gcgcgcccaa gagtcgacct ccgatctgct 23520 
gcggttcgcc gaatcccagc tcgtggtcgt 23580 
ttcggacgtc gcggacctgg tgggtgcgtc 23640 
ggagaacccg ggtcgcttcg tgctggtgga 23700 
gttgccggcc gccgtgcgag caggagaacc 23760 
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gcagctggca cttcggcgcg gcgtggcgct 
cgaggagggc tcctccccgc aactcgacac 
cggtgcgttg gggggagtgg ttgcccgtca 
ggtgttggca ggccggcgtg gctggaatgc 
ggcgcgcgcg ggcgccgtgg ttgaggtggt 
ggagcacgtg ctggccgcca ttccggtcga 
tggggtgctg gccgacggag tgatcgggtc 
tgccccgaag gtgacggggg catggcatct 
gttcttcgtt cttttctctt ccttctccgg 
cgcggcggcg aacacgttcc tggatgcatt 
tgggttgtcg ttggcgtggg gactgtgggc 
cgcggcgtcg gtggagcggt tggcgcggac 
actccgcctg ttcgatgccg cgttcgcgaa 
ggacagggcg ctgctggtcg ggaacggacg 
gttggttcct gttcgcggcg gtgtggcgag 
ggacgcactg ttgggtttgg tgcgggagca 
ggtcgaggtt gggggcgacc gtgctttccg 
ggagttgcgg aaccgccttg ccggggtgct 
cgactatccg acgccgcggg cgctggcgcg 
cgcgtccacg tcgacgccgg tgaccagggc 
tgtcgggatg ggatgtcgtt ttccgggtgg 
ggtggccggc ggcgtggatg cggtggctgg 
ggcgttgtac gatcctgatc ccgatcgtct 
tctgcgggac gcggcggagt tcgatgctga 
ggcgatggat ccgcagcagc ggttgctgct 
tgggatcgat ccgttctcgt tgcacggcag 
ccacgactat ggggcccgat tcattaccag 
gacgggcaat gcggggagcg tgctgtcggg 
tcctgcggtg acggtggata cggcgtgttc 
tcaagcactg cgggccggtg agtgcgaatt 
gacgccgacg acgttcgtgg agttctcccg 
caagtcgttc gcggcggccg cggatggcac 
gctggagcgg ttgtcggatg cccggcgcaa 
tagcgcggtg aaccaggacg gcgcgtcgaa 
gcaaagggtg atcacccagg cactcacgag 
tgtggaggcg catgggacgg gcacgcggct 
cgctacgtac ggccgggatc gtgatcccgg 
gaatattggt cacacccagg cggcggcggg 
gatgcggcag ggggagctgc cgcgcacgtt 
ctggtctgcg ggcacggtcc aactcctcac 
tcttcgccgg gcgggcgtgt catcgttcgg 
tgaacaacct ccgcgagagt cgcagcgctc 
ttttccggtg gtgccgtgga tggtgtcggg 
agatgcattg atgtcctact tgagcaatcg 
ttcgcttgcg gtgacccgtc cggcgttgga 
tgccgcgttg ctgccgggct tgaaagcgct 
caccggcact cgtgccgctg ggccggtcgg 
gcccgggatg ggaagcgggc tccactcggc 
agcctgctgc gagctggatg cgcatctcgg 
cggttcggat acgcaacttc tggaccagac 
gcaagtcgga ctctgggagt tgttgggttc 
ccactcggtc ggtgagctgg cggcggcgtt 
ggctcggctg gtggcgggcc gtgcccggtt 
gctcgctgcg gctgctggag aggagcagct 
tgtggggatc gccgcggtca acgctcccgg 
gctcgatgac attgccggtc ggctggacgg 
ttcgcatgcg tttcattcgc atcggatgga 
ccggagcgtg gactaccggt cgtcagggct 
cgatgaggtc ggcatgccgg ctacgccgga 
ccgcttcgcc gacggtgttg ctgcgctcgc 
cggtccggat ggggtgttgt cggcgctggt 
cggacgggtg gccgcggttc cgctcatgcg 
aacggcattg gcgcagatcc atgtgcgtgg 
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ggtgcctcgg ttggcgcgac tcacggtgcg 23820 

ggacgggacc gtcctcatca cgggtggcac 23880 

cctggtggag gagcacggga ttcggcgttt 23940 

gcctggagtc cacgagttgg tggatgagct 24000 

ggcttgcgat gtggctgacc gcaccgatct 24060 

ctggccgctg cgggggatcg tgcataccgc 24120 

cttgtcggcg gcggatgtgg gcacggtgtt 24180 

gcacgagttg acccgcgatc tggatctgtc 24240 

gattgcgggt gccgcagggc aggccaacta 24300 

ggcgcgttat cgccgggcgc gtgggctgcc 24360 

gcaacccagc ggtatgacga gtggcttgga 24420 

gggcatcgca gaactttcca cggaggatgg 24480 

ggaccgggct tgcgtcgttg ccgctcgatt 24540 

atcgcacgcg attccggcgc tgttgagcgc 24600 

gaaaacagcc aattctcagg ccgcggatga 24660 

cgtttcggcc gtgctgggtt attcgggtgc 24720 

tgatctgggt tttgattcgt tgtctggcgt 24780 

gggggtgcgg ttgccggcga ctgcggtgtt 24840 

tttcctgcat caggaactgg caggcgaggt 24900 

agcgagtgcc gaagaggatc ttgttgcgat 24960 

ggtgtcgtcg ccggaggagc tttggcggct 25020 

gttcccagac gatcgcggct gggatctcgc 25080 

cgggacctcg tatgtgtgtg agggcgggtt 25140 

catgttcggc atcagcccgc gtgaggcgtt 25200 

ggaggtcgcc tgggaaacct tggagcgggc 25260 

ccggaccggt gtgttcgcgg gcttgatgta 25320 

agcaccggag ggcttcgaag ggcacctcgg 25380 

tcgggttgcg tattcgtttg gtttcgaggg 25440 

gtcgtcgttg gtggcgttac acctggcggg 25500 

cgcccttgcc ggtggcgtca cggtgatgtc 25560 

tcaacggggt ctggctccgg atgggcggtg 25620 

cgggtggggc gagggtgccg gtctggtgtt 25680 

tgggcacgag gttctggcgg tggtgcgggg 25740 

tggcttgact gcgccaaatg gtccgtcaca 25800 

tgccgggctg tccgtgtccg acgtggatgc 25860 

tggtgatccg atcgaggcgc aggcgttgat 25920 

tcggccgttg tggctggggt cggtgaagtc 25980 

tgtcgctggt gtgatcaaga tggtgatggc 26040 

gcacgtggac gagccctccg cgcaggtgga 26100 

ggagaacacg ccctggcccg acagcggtcg 26160 

gatcagtggc accaacgcgc acctgatcct 26220 

aacagagccg gattcgggtt ctgtccgcga 26280 

caaaacaccc gaagcgctat ccgcccaggc 26340 

cgttgatgct tccccgcgag atatcggtta 26400 

ccaccgcgct gtcgtgctgg gtgcggatcg 26460 

ggccgttagt aatgacgctg ccgaggtgat 26520 

attcgtgttc tccggtcaag gtggtcagtg 26580 

gtttccggtg ttcgccgacg cgtttgacga 26640 

gcagatggcc cggctacgag atgtgttgtc 26700 

cttgtgggcg cagccgggcc tgttcgcgtt 26760 

gtggggtgtc cggcccgctg tggtgctggg 26820 

cgcggctgga gtgttgtcgt tgcgggatgc 26880 

gatgcaagcc ctgccaactg gcggtgccat 26940 

gcgcccgttg ctggccgact gcggtgatcg 27000 

gtcggtggtg ctctccggtg atcgggatgt 27060 

gcaagggatc cggtccaggt ggttgcgggt 27120 

tccgatgctg gcggagttca ccgaaatcgc 27180 

gccgatcgtg tcgacgttga cgggtgagct 27240 

gtattgggtg cgccaggtgc gagaacccgt 27300 

ggctcacggt gtgagcaccg tcgtcgaggt 27360 

gcaggagtgc gcggccggat ccgatcaggg 27420 

cagcaatcgc gacgaggcgc acacggtgac 27480 

tgctgaggtg gactggcggt cgtttttcgc 27540 
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cggtaccggg gcaaagcagg tcgagctgcc 
gcttgactca ccatccgaac cggtcgggca 
ctgggaactc gtcgagcagg aagatgtcag 
cgatcacgac gtgcaggcgt ccctggaatc 
ccggatccgc aacgaatccc tggtgcacca 
ggcagatttg ccagacccct cgttgtcggg 
gtcggcgagt cggcaagttc tgcgtttcaa 
agttctgttc gagctcgccg ggcacgacga 
gcctgttgcg tcagggggaa taagcggcgt 
gtcctcgccg aacgctgctt tgccgaatgg 
tctgcgggcc gcggatgtgt cggcgccatt 
cggggatgtg ccggtgaacc cggggcaggc 
tctggagcat ccggcctggt ggggtggcct 
cgctcgagaa cgcttgtcgg tcgtgttggc 
tcccggtggt gtgttcgtgc ggcggttgga 
ggtgtggcgt cctcggggga cggtgttggt 
tgttgcccgg tggttggcgg gtgccggggc 
cgcggcggct ccgggcgctg gagatttgcg 
ttcgatcacg gcctgcgacg tggccgatcg 
tccggatgat tgcccgctga ccgcggtgat 
cgtggcgtcg atgtgtttga ccgacttcgt 
ggcgaatctc gatgagttgc tcgccgatgt 
cgtctcgggt gtgtggggtg ctggcgggca 
ggatgcgttg gcgcagcagc gtcgggcaag 
cccgtgggcc ggtgacggaa tggccgcagg 
cctggtgcca atggctgcgg atcgggcgtt 
cgagacatcc ctggtcgtgg ccgatatggc 
gtcccgtcgg cgtccgctgc tcgacgagct 
ggagaacacc actgatgctg cggactcggc 
agccgccgaa cgccgccgcg cgatgctgga 
gggacacaac gggtctgacc cagttggtcc 
ttcgctgatg gccgtcgaac tgcgcaacag 
ggccacgttg atcttcgatt atccgagccc 
gctggtggga gcgcagcccg cgaccaccgt 
ggttgtcgtg gtcgcgatgg gatgccggta 
gtggcagctg gtttctgcgg gacgtgatgc 
ggactgcaac acgttgttcg acccggatcc 
aggtgccttc ctgaccggtg ctgatcggtt 
cgaggcgcgc gcaatggatc cgcagcagag 
cgaacgagca ggaatcgctc cgctgtcgtt 
gaccaatggg caggaccacg gtgcgaaagt 
cctgaccgga aacgccgcga gtgtcctggc 
ggggcctgcg gtggcggtgg ataccgcgtg 
gtgccagtcg ctgcgttcgg gtgagtgtga 
gtcgacaccc ctggctttcc tcgagttctc 
gtgcaagtcg tttgcggccg ctgcggatgg 
gttgctggag cggttgtcgg atgctcgtcg 
cgggtctgcg gtgaatcagg atggtgcgtc 
gcagcagcgg gtgattcggc aggccctcgc 
tgtcgtggag gcgcacggga ccggtaccgg 
gatcgcgaca tatgggcagg agcgggatcc 
gtccaacatc ggccacacgc aggcggcggc 
ggccatgcgg cacggggagt tgcctgcgac 
ggactggtct gccggggccg ttcggctcct 
ccgtcctcgt cgagcggggg tgtcgtcgtt 
cctcgaacaa ccaccgtcgg aaccagcgga 
gcatccagcg gtgatcccgt ggatgttgtc 
agcggctgcg ctgcaggccc ggctggaccg 
gtattcactc gcgaccactc gttctgtgct 
tcgggaggca ctgctgtcca ggctggcagc 
gataacgggc tctgcgaatt ccggtggccg 
tcagtggctg gggatgggaa aggcgttgtg 
cgaggaagcc tgcgacgcgc taagcgcaca 
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cacgtatgcc ttccaacgac agcggtactg 27600 
atccgccgat cccgcgcgcc agtcgggctt 27660 
cgcgctcagc gccgctctgc acattaccgg 27720 
ggtggttccg gtcctctcct cctggcatcg 27780 
gtggcggtac cggatttcct ggcatgagcg 27840 
gacatggctc gtcgtcgtgc cggaggggtg 27900 
cgagatgttc gaggaacggg gttgcccggc 27960 
ggaagccctg gcgcaacgat tccgctcgtt 28020 
gttgtccttg ctggcgctgg atgaatcgcc 28080 
cgcgctgaac tcgttggtac tgctgcgagc 28140 
gtggttggcg acgtgtggtg gtgtcgcggt 28200 
gctggtgtgg ggactgggtc gcgtcgtcgg 28260 
ggtcgacgtg ccgtgcttgc tcgatgagga 28320 
aggtcttggc gaggacgaga tcgcggtacg 28380 
acgcgctggt gcggcgtcgg gtgccgggtc 28440 
gacgggtggt acgggcggtt tgggggcgca 28500 
tgagcatgtg gtgttgacca gccgtcgagg 28560 
ggcggagctg gaggcgctgg gcgctcgggt 28620 
tgacgctttg gccgaagtgt tggcgaccat 28680 
gcatgcggcg ggggtcgttg aagtcggcga 28740 
tggggtgctg tcggcgaagg caggtggtgc 28800 
cgagctggat gccttcgtgc tgttctcatc 28860 
gggcgcttat gcggcggcga atgcctactt 28920 
ggggttggtg gggactgcgg ttgcgtgggg 28980 
tgaaggcggt gcacagctgc gccgggccgg 29040 
gctggcactt cagggcgcat tggatcgtga 29100 
gtgggagagg ttcgccccgg tgttcgccat 29160 
gcccgaagca cagcaggcgt tggcggatgc 29220 
cgtcccgcta ccgcggctcg cgggcatggc 29280 
cctggtgctg gcggaggcct cgattgtgtt 29340 
cgaccgggcg ttccaggagc tcggatttga 29400 
gttgggcgag gcaacaggat tgagtctgcc 29460 
atccgcgctg gctgagcagc tggtcggcga 29520 
cgtggccggg gccgatccag tggatgatcc 29580 
tccgggcgac gtctgctcgc ccgaggagct 29640 
ggtatcgacg ttccccgtcg atcggggttg 29700 
ggatcgggca ggcagtacct atgtgcgaga 29760 
cgacgccggg ttcttcggca tcagccctcg 29820 
gttgttgctc gaagtggcgt gggaggtttt 29880 
gcggggtagc aggaccggtg tgttcgcggg 29940 
ggctgccgcg ccggaggcgg cgggtcacct 30000 
cggccggctt tcctacacgt tcggccttga 30060 
ttcgtcgtcg ttggtggcgt tgcatttggc 30120 
tatggcgttg gcaggtggtg tgacggtgat 30180 
tcgtcagcgc ggtttggcgc cagatggtcg 30240 
caccgggtgg ggtgagggtg ccggcctggt 30300 
gaatggtcac cgggtgttgg ccgtggttcg 30360 
gaatggcctg actgcgccga atggtccgtc 30420 
gaatgcgggg ctgtcggcgt ccgatgtgga 30480 
gctcggggat ccgatcgagg cgcaggcgct 30540 
tgagcgggcc ctgtggctgg ggtcgatcaa 30600 
cggtgtggcg ggggtcatca agatggtgca 30660 
gttgcacgtg gacaagccca ctccacaggt 30720 
caccgggaac acgccctggc ccgagagcgg 30780 
cgggatcagc ggcaccaacg cacacctcat 30840 
gatcgaccaa tcggatcggc gggtcactgc 30900 
ggctaggagt ctcgcagcgc tgcaggccca 30960 
gggtcctggc gcttctccgc tggatttggg 31020 
ggacgaacgc gccgtcgtgt ggggtgccga 31080 
gctcgccgat ggccggacgg cgccgggggt 31140 
catcggattc gttttttccg gtcagggcag 31200 
cgcggctttc ccggcgttcg cggacgcctt 31260 
cctgggcgcg gacgttcggg gtgtgctgtt 31320 
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cggtgctgat gagcagatgc tcgaccggac 
tcaagtcggc ctcctgggat tgctgaggtc 
gcactcggtc ggcgagttgg ctgcggcgca 
tgcacggttg gttgcggctc gggcccacct 
gctcgcggtc gccaccagcg aggcggcggt 
ggtcagcatc gctgcgatca acggccccga 
gctcgtggag ctcgcaggcg aattcgatgc 
ctcccatgct ttccactcgc accggatgga 
caggtgcgtc gagttcggtg aaccggtggt 
ggacaccacc ggactgatgt gcgcggccga 
ccgcttcgga gacggtgtcc gggcgctcgt 
cggcccggac ggggcgttgt cggccctggt 
tgggagggtg gcggcgatcc cgctgatgcg 
ggcggccctg gcgcacgtgc acgtccgcgg 
cggcaccggc gcccgcaccg tcgagttgcc 
gctggccggg caagcggacg ggcgcggcgg 
cttctgggag ttggtcgagc gcgccgatcc 
ccgggaccag cccttccggg aggtgctgcc 
ccaggaggcc ctcgcggatt cctggcgcta 
gtccgcagcc gccctccggg gcgtgtggct 
tcaaccggcg gtcgtcatcg acgcgctgat 
attgaccgag caggacctcc aacgcagtgc 
ggaccgcacc gaggtgacgg gtgtgttgtc 
ggcgcatccg cacctgtccc gtggtgtcgc 
cgatgcgggt gtttccgccc cgctgtggct 
cgaggacggt ccggccgatc cggaccacgg 
ccttgaacat ccgcagtggt ggggtggcct 
gtcccggaac gggttggtgg ccgcactcgc 
gcgttcatcc gggttgttcg ttcgcagagt 
gacatggcgt agccggggaa cggtcctcat 
ggtcgcacga tggctggccc ggcggggagc 
cccggaagct cccggcgcag cggacctagg 
cacagtcttg gcctgcgatg tgacggaccg 
tcccacggag tatccgctgt cggcggtcgt 
cctggccgag acgaccttgg cgcagttcgc 
ggcgaacctg gaccggctgc ttggcgggca 
gatctcggga gtttggggag ccggcggcca 
cgatgccctt gccgagcgcc gacgggcttg 
tccgtgggcg ggtgcgggca tggccgttca 
gggcctggta ccgatggaac cgcagtcggc 
acgagaaacc gccatcaccg tcgcagatgt 
cgcggcccgc ccgcgaccac tgttggaaga 
cgaggagaag cacggtgccg gcgagctggg 
gcgcggacac ctgctgctgg aggtggtgct 
ttcggcggag gctgtgcaac ccgatcggac 
cgcggtagag ctgcgcaaca ggttgaacgc 
ggttttcgac cacccgacgc cgctggcgtt 
ggagccggac aacggcatcg aatcgctgct 
ggcgcaaggg ccttcgatcc cactggaaga 
actcctcgcc aagtgggacg gggcgcgtga 
gctgacggcg gccacggacg acgaaatctt 
accgcccttt cctcgcctca gctcccctga 
gaagaaaagc tccgcgagta cctcaagcgt 
cgcctgcacg agttggagcg ccaggagcac 
cgttatcccg gtggcgtctc cactccggag 
gacgcgatcg cgaacttccc cgaagaccgt 
gatccgggcc gagccgggac ctcctacgtc 
gacttcgatg ccgggctctt cgggatcagt 
cagcggttgc tgctggagat ctcgtgggag 
tctttgcggg gtaccaagac cggtgtgttc 
cggtttcgca agacccccgc ggagttcgag 
gtcgcatccg gccgggtggc ttacaccttc 
accgcctgct cgtcgtccct ggtggcgctg 
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gctgtgggcg cagtcgggga tcttcgcggt 31380 

gtggggcgtg cggccggccg cggtgctggg 31440 

cgcggctggt gtgttgtcct tgccggacgc 31500 

gatgcaggca ttgcccaccg gcggcgcaat 31560 

cggaccgctg ctttccgggg tgtgcgatcg 31620 

gtcggtagtg ctctccggcg accgcgatgt 31680 

ccgagggctt aggaccaaat ggttgcgggt 31740 

accgattctg gacgagtacg cggaaaccgc 31800 

gccgatcgtc tccgccgcga ccggtgcgct 31860 

ctactggacg cgccaagtgc gtgatcctgt 31920 

cggccaaggc gtggacacga tcgtcgagtt 31980 

cgagcagtgc ttggccgggt ccgaccaggc 32040 

cagggaccgc gatgaggtcg agaccgcggt 32100 

tggtgcggtg gactggtcgg cttgcttcgc 32160 

cacctacgcc ttccaacgcc agcggtactg 32220 

cgatgtggtt gccgacccgg tcgacgcgcg 32280 

ggaaccgttg gtggatgaac tctgcatcga 32340 

cgttctggct tcctggcgcg agaaacaacg 32400 

ccaggtgcgc tggaggtccg tcgaggtgcc 32460 

ggtggtgctt ccagctgacg tgccccgaga 32520 

cgcgcgcggc gccgaggtcg cggtcctgga 32580 

gcttgtggac aaggtgcgcg ccgtcattgc 32640 

tctgttggcg atggacggca tgccctgcgc 32700 

cgctaccgtg atcctgacgc aggtgttggg 32760 

ggccacgacc ggtggcgtcg aggccgggac 32820 

cttgatctgg gggctcggca gggtcgtcgg 32880 

gatcgacctt ccggagacac tggacgagac 32940 

cgggacggcg gccgaagatc agctcgccgt 33000 

ggtgcgcgca gcgcggaacc cccggtcaga 33060 

cacgggcgga acaggcgcgc tcggtgccga 33120 

tgagcacctg gtgttgatca gtcgccgcgg 33180 

ggccgagctg actgaactcg gcgtgaaagt 33240 

cgacgagctg gcggcggtgc tggcggccgt 33300 

gcacaccgcc ggcgtcggga cgcctgcgaa 33360 

cgacgtgttg tcggccaagg tcgtcggcgc 33420 

accgttggac gccttcgtgc tgttctcctc 33480 

aggagcctat tcggccgcca atgcgtatct 33540 

cgggcggccg gcgacgtgca tcgcctgggg 33600 

ggaaggtaac gaggcgcatc tccgccgaag 33660 

cctcttcgcg ctgcaacagg ccctgtccca 33720 

ggactgggag cgattcgccg cctctttcac 33780 

gatcgtggat ctacggcccg acaccgagac 33840 

gcagcagctg gccgcactgc cgcccgctga 33900 

ggcggaaacc gccagcaccc tggggcacga 33960 

cttcgccgaa ctgggcttcg attcgctgac 34020 

ggtgaccggg cttcgcctgc cgccgacgct 34080 

gtccgaacag ttggttccgg ccctggtcgc 34140 

cgccgagctc gacaggctgg ataccacgtt 34200 

ccaggccaag gtggcggagc gcttgcacgc 34260 

cggcacggcc agagcgacgt caccccaatc 34320 

cgacctcatc gaccggaagt tccggcgctg 34380 

ttactggaac ggtgtatttc gatggccaat 34440 

gtcgtcgtcg aactggaaga ggcgcacgaa 34500 

gaccccatcg cgatcgtgtc gatgggatgt 34560 

gagctgtggc gactggtcgt cgacggagga 34620 

ggctggaatc tggacgagct gttcgatcct 34680 

cgcgagggtg gtttcctgcg cggggtcgcg 34740 

ccgcgcgagg cacaggcgat ggacccgcaa 34800 

gtgttcgagc gcgccggcat tgacccgttt 34860 

gcgggcctga tctaccacga ctacgcgtcg 34920 

ggttacttcg ccaccggcaa cgcgggcagc 34980 

gggttagagg gcccggcggt caccgtggac 35040 

cacctggcct gccagtccct gcggctgggc 35100 
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gaatgcgacc tggccctggc cggtggcatt 
gagttcagcc ggcaacgcgc actcgcctcg 
gccgacggca ccggctgggg cgagggcgcc 
gcacgacgaa acggccaccc ggtgctggcg 
gggacgtcca acggcctgac cgcgcccagc 
gccctggcga acgccgggtt gtcgcccgcc 
ggcacggcct tgggcgaccc gatcgaggcg 
cggtcggcgg atcatccgct gctgctgggt 
gctgccgccg gtgtggccgg ggtgatcaag 
ccccgcagcc tgcacatcga ccagccatcg 
cggctgctca cggacagcgt tgactggccg 
tcctcgttcg gcatgagcgg taccaacgca 
ccggtctcgg gcagtaccga gccgaccggg 
gagacggcat tgcgcgagca ggctgccgag 
ccgggactgg gggacgtcgg gtactcgctg 
gctgtcgtgg ttgccgacga tcgggactct 
ggcgttccgg cagccaacgt ggtgcagggc 
gtgttccccg gccagggctc gcattggcag 
ccggtgttcc ggcggaagct ggcggaatgc 
tcgctgctcg gcgtccttcg cggtgatccc 
attcagctcg cgctgttcgc catgatggtg 
gtggagcccg ccgcggtggt cggtcattcc 
ggcgctttgt ccttgactga tgcggtgcgc 
gcgctgaccg ggaagggagg catgctcgcg 
cgaatcgcag gcctgccgga gctgaccgtt 
gtttccggcg aaccgtcggc tctggagcgt 
cagacccggc gggtgggaat tgattacgcc 
ggccggcttc tggaccggct gggcgaagtc 
tcgacggtca ccggcgagcg gacggacacc 
aaccttcggc agcccgtccg cttccagcag 
cggttcttcg tcgaggtgag cccgcacccg 
gaagccgcgg acgcgggcgg ggtggtggtc 
cggcgctggc tgacttcgct ggccgagtgc 
caggtattcc tcaacaccgg agcccgacgc 
cagcggtact ggttggagtc cgccgagtac 
ctctccgccg agcatcccct gctcggggct 
ctgctgaccg gcaagctgtc ggtcaagacc 
ggggcgatcc tgctgcccgg caccgcgttc 
gtcgggtgcg atctgatcga ggagttgtcc 
ggtgcggtgc aggtgcagat cgcggttggc 
cgcgtgcatt cctgtcgaga cgacgccgtg 
ggcacgttga cctccagcga tcaccaggac 
cccaacgatg ctgtcgcggt tccgctggac 
ttcgatttcg gcccggcgtt ccaggggttg 
ttcgccgagg tcggcctgcc caccgcacac 
cctgctctgc tggatgcggc actgcaggcg 
ggatggctcc cgttcgcgtg gcaaggtgtg 
cgggtgcacc tcgttccggc gggcgcgaat 
ggccaagccg tgctctccat cgattcgctg 
gcagcggccc gtgcgatgga acacgagtcc 
cccggcgctg ccaagccggt ctcctgggca 
tgcggctcgg cacttggcac ggaactccac 
ccggacgtcg tggtggtgcc atgcggtgcg 
gcacgtgccg cgacacaacg catgcttgac 
ttcgccggat ctcgcctggt ggttgtgacg 
ggtgtttccg acctggtgca tgctgcgtcg 
aacccggacc gattcgtgtt ggtcgatgtg 
gcggcggccg tgcgttccgg agaaccgcag 
cctcgcctgg cgcgatgtgt tgccgccgag 
gggacggtgt tgatttccgg cggtacgggc 
gtggcggagc gcggtgtccg ccgcctggtg 
ggggtcaccg acctggtgga tgagttggtg 
tgcgatgtcg gggatcgggc ccagttggac 
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tcggtgatgg ccacgccggg agccttcgtc 35160 

gatggccggt gcaagccctt cgcggatgcc 35220 

ggaatgctgc tgctggaacg gctgtcggac 35280 

gcggtggtcg gttccgcgat caaccaggac 35340 

ggtcccgcac agcagcgagt gatccgccaa 35400 

gaggtcgatg tggtcgaggc gcacggcacg 35460 

caggccctga tcgccaccta cggggcgaac 35520 

tccctcaagt cgaacatcgg ccacacccag 35580 

tcggtcctgg ccatcaggca ccgggagatg 35640 

cagcacgtgg actggtcggc gggcgcggtg 35700 

gatctcggca ggccgcgccg agcaggggtg 35760 

cacctgatcg tcgaggaagt atccgacgag 35820 

gcatttccct ggccgctgtc cggcaagacg 35880 

ttgctctccg tagtgaccga gcacccggag 35940 

gccaccggtc gcgctgcgat ggagcaccgg 36000 

ttcgtcgccg gactgacggc gttggctgcg 36060 

gcggccgact gcaagggaaa ggtcgcgttc 36120 

gggatggcga gggaactgtc cgaatcctcg 36180 

gcggcggcta cggcccctta cgtggactgg 36240 

gatgcacccg cgctggatcg cgacgacgtg 36300 

tcgctggccg aactgtggcg ttcgtgcgga 36360 

cagggcgaga tcgccgccgc ccatgtggca 36420 

atcatcgctg cccgctgcga tgcggtgtcg 36480 

attgccttgc cggaaagcgc ggtggtgaag 36540 

gcggcggtca acggacccgg ctccactgtc 36600 

ctgcagaccg aactgaccgc ggaaaacgtg 36660 

tcgcattcgc cgcagatcgc gcaggtccag 36720 

gggtccgaac ctgctgagat cgctttctac 36780 

ggccgactcg acgccgacta ctggtaccag 36840 

accgtcgccc ggatggcaga tcagggctat 36900 

ctgctcaccg ccggaatcca ggaaacgctg 36960 

ggttcgctgc ggcgtggcga gggcggctcc 37020 

caggtgcgcg gactgccggt gaattgggaa 37080 

gtgccgctgc cgacctaccc gttccagcgg 37140 

gacgcgggcg atctcggttc ggtgggcttg 37200 

gcggtgacgc tggccgatgc gggcgggttc 37260 

cagccctggt tggccgacca cgtggtcggc 37320 

gtggaaatgc tgatacgcgc cgcggaccag 37380 

ctgacgactc cgctggtttt gcccgcgacc 37440 

ggtccggacg aggccgggcg ccgctcggtc 37500 

ccgcaggact cgtggacctg ccacgcgacc 37560 

gccggccagg gccccgatgg gatttggccg 37620 

agcttctacg cccgcgcagc tgagcggggc 37680 

caggcggctt ggaagcgcgg agacgagatc 37740 

cgcgaagacg ccggcaggtt cggaatccac 37800 

ctgggcgcag ccgaagagga tccggacgag 37860 

tccctcaaag cgacgggcgc actttccctt 37920 

gcggtgtcgg tgttcacgac cgacacgact 37980 

gtgctgcgcc agatttcgga caagcagttg 38040 

ctgttccggg tcgactggaa gcgaatctcg 38100 

gtgatcggca atgacgaact cgcccgagcc 38160 

cccgacctga ccgggttggc tgacccgccc 38220 

tctcgccagg acttggacgt tgcttccgag 38280 

ctgatccagg attggttggc ggcggcgcga 38340 

tgtggtgcgg cgtcgacagg tcccgccgag 38400 

tggggtttgt tgcgttcggc gcagtcggag 38460 

gacggaaccg ccgaatcatg gcgtgcgctc 38520 

ctggcgttgc gcgccggtga agtccgggtg 38580 

gacagccgga tcccagtgcc cggtgcggat 38640 

ctgctgggcg ggttggttgc ccggcatttg 38700 

ctcgcggggc gacgcggctg gagcgccccc 38760 

ggcctgggag ctgcggtcga ggtggcgagc 38820 

cggctgctga cgacgatctc ggcagagttc 38880 
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ccgctgcgcg gagtggtgca tgcggccggg 
acaccagagc acgtggcaaa ggtgttcggc 
gagttgactc ttgatctgga tctctcgttc 
gcgggggctg cgggtcaggg aaactacgcg 
cagcaccggc ggacggcggg gctgcctgcg 
cccagcggga tgaccggagc gctcgatgcg 
ccgccgatgt ccgcgccgga cgggttgcgg 
gaatcgcttc tggttccggt ccacgtcgac 
ggcggtgtgc ctgcgttgtt gcgcgacctg 
aacgagtcgg cggacgtcaa cggtctggtt 
caggaaaccc agctgttggg tttggtgcgc 
ggtgcggtcg aggtcggggc cgatcgtgct 
ggtgtggagt ttcggaaccg gcttggcggg 
gtgttcgact atccgacacc gcgggcgttg 
ggcgtggagg ctccgactcc cgcaccggcg 
gttgtgatcg tggggatggg ctgtcgttat 
tggcgtttgg tggccggggg cttggatgcg 
gatcaggcgg ggttgttcga tccggatccc 
ggtggcttcc tgcgagatgc ggcagagttc 
gaggcgttgg cgatggatcc gcagcagcgg 
gagcgggcgg ggattgatcc gctttcgttg 
ctgatgcacc acgactacgg cgcgcggttc 
tatctaggta atggcagcgc gggaggcgtg 
ttcgagggtc ctgcggtgac ggtggatacg 
ctggcgggtc aagcactgcg gtctggtgag 
gtgatggcca cgccggggat gttcgtggag 
gggcggtgca agtcgtttgc ggcggctgcg 
ttggtgttgt tggagcggct gtcggatgcc 
gtgcggggta gcgcggtgaa tcaggatggt 
ccctcgcagc agcgggtgat cacgcaggcg 
gtggacgccg tggaggcgca tgggactgga 
gctctgattg ccacttacgg gcaggggcgg 
gtgaagtcga atattggtca tacgcaggcg 
gtgatggcga tgcggcacgg gcagctgccc 
gaagtggatt ggtcggcggg ggatgtccag 
aacagccatc ctcggcgggt gggcgtgtcg 
gtcatcctcg aacaagcctc gaaaacacca 
tcggaatcga ccgtggacct tccagcggtc 
gcgctcagcg ctcaggcgag cgcattgttg 
acgctggatg cggcgttttc gttggcttcc 
gtgctgggag cggaccgcga aacgttgttg 
gaggcttctg gggtggtgtc gggatccccg 
ggtcagggcg gacagtggtt ggggatgggc 
gctgacgcgt ttgacgaagc atgtgccgga 
gtccgggatg tggtgtttgg ttccgacggg 
tcgggtttgt tcgcgttgca ggttggtttg 
ccgggtgtgg tgctgggcca ttcggtcggc 
ttgtcgttgc cggatgcggc tcggatggtg 
ccttctggcg gtgccatgtt ggcggtggct 
gccgatcggg ttgatggtgc gggtatcgcc 
tccggcgatc gggaggtgct tgacgacatc 
tggcggcggt tgcgggtttc gcatgcgttt 
gagttcgccg aaatcgcacg cagcgtggac 
acgttgacgg gtgagctcga caccgcaggt 
caggttcgag agcccgtccg cttcgccgac 
gccacgatct tcgaactcgg ccctgatgcg 
tcgtgggctg atcaggccat gccgattccg 
actgtggtcg ccgcggtggc gcgggcgcac 
tatttcgccg gcaccggggc acggcgggtc 
cggtactggc tggaaacatc ggattacggc 
gagcatccgt tgctgggggc cgtggttgcg 
ggccggttgt cggtggggac gcatccgtgg 
gtcgtccccg gcaccgccat cctggagatg 
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gcacttgccg acggggtcgt cgagtcgctg 38940 
ccgaaggccg ccggtgcgtg gcacctgcac 39000 
ttcgtgctct tctcctcgtt ctccggcgtg 39060 
gcggcgaacg cgttcctgga cggcctggct 39120 
gtgtcgctgg cttggggctt gtgggagcag 39180 
gcgggccgta gccgcattgc gcgcaccaat 39240 
ctgttcgaga tggcgtttcg cgttccgggc 39300 
ctgaacgccc tgcgcgctga tgcggccgac 39360 
gtgccagcgc ccgtgcggcg gagcgcggtc 39420 
ggtcggctgc ggaggctgcc ggacctggat 39480 
gagcatgttt cggcggtgct ggggcattcg 39540 
ttccgggatt tgggttttga ttcgttgtcc 39600 
gtgctgggcg ttcggttgcc ggctactgcg 39660 
gttcggttct tgctcgacaa actgattggt 39720 
gctgtggcgg cggtgactgc tgacgatccc 39780 
ccgggtgggg tgtcctcgcc ggaggagctt 39840 
gtggcggagt tcccggacga tcgtggctgg 39900 
gatcgtcttg ggacctcgta tgtgtgtgag 39960 
gatgccggtt tcttcgggat ttccccgcgt 40020 
ttgctgctgg aagtcgcttg ggaaaccgtg 40080 
cgggggagcc ggaccggcgt gttcgcgggg 40140 
atcacgaggg cgccggaggg tttcgagggt 40200 
ttttcgggtc gggttgcgta ttcgtttggt 40260 
gcgtgttcgt cgtcgttggt ggcgctgcac 40320 
tgtgatctgg ctcttgcggg tggtgtgacg 40380 
ttttcgcgtc aacggggctt ggcggcggat 40440 
gatggcaccg gttggggaga aggcgcgggc 40500 
cggcgcaacg ggcacgcggt tctggcggtc 40560 
gcgtcgaatg gtttgacggc gccgaatggg 40620 
ttggcgagtg ctggtttgtc ggtgtctgat 40680 
accaggcttg gtgatccgat tgaggcgcag 40740 
gatagcgatc ggccgttgtg gttggggtcg 40800 
gcggcgggtg tcgctggtgt gatcaagatg 40860 
gcgacgttgc atgtggatga acctacgtcg 40920 
ctcctcacgg agaacacccc ctggcccggc 40980 
tcgttcggga tcagcggcac caacgcacac 41040 
gacgagactg cggacaagag cggtcccgat 41100 
ccgttgatcg tgtcggggag aacaccggca 41160 
tcctatttgg gtgagcgtgg cgatatttcc 41220 
tcccgggccg cgttggagga gcgggcggtg 41280 
tccgggttgg aagcgctggc ttccggtcgc 41340 
gtctctggcg gggttgggtt cgtgttcgcc 41400 
cgggggctct actcggtttt tccggtgttc 41460 
ctggacgcgc atctggggca ggacgtgggg 41520 
tccttgttgg atcggacgct gtgggcccag 41580 
ctgagcctgc tgggttcgtg gggtgtccgg 41640 
gagttcgcgg cggcggttgc ggcgggagtg 41700 
gcgggtcgtg cccggttgat gcaggcgttg 41760 
gctggtgagg agcagctgcg gccgttgttg 41820 
gcggtcaacg ctcctgagtc ggtggtgctc 41880 
gccggcgcgc tggatgggca agggattcgg 41940 
cattcgtatc ggatggaccc gatgttgcag 42000 
taccggcgtg gcgacctacc ggtcgtgtcg 42060 
gtgatggcta cgccggagta ttgggtgcgt 42120 
ggcgtccggg tgctcgcgca gcaaggggtc 42180 
acgctgtcgg ccctgattcc cgattgtcat 42240 
atgctgcgta aagaccgtac ggaaaccgaa 42300 
acgcgtggtg ttccggtcga atggtcggcg 42360 
gagttgccga cgtatgcctt ccagcggcag 42420 
gatgtgacgg gtatcggcct ggctgcggcg 42480 
ctggccgatg gtgatgggat ggtgctgacc 42540 
ctggcccagc atcgcgtgct gggcgaggtc 42600 
gccctgcacg caggggcgcg tctcggctgt 42660 
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gaccgggtgg aagagctcac cctggaaaca 
gccggtagtc gtggccctgc gggagggacc 
gtgcggacga acgacgccat cgaaatccag 
cggcgaaggg tgtcgctgta ttcccgcccg 
cgccacgcca ccggcgaact cgtcgtcggc 
tcggctgagg gtgccgagtc gattgctctc 
gggttcgagt acgggccgtt gttccagggg 
gttctcgccg aaatcgcccc gccggccgag 
gacccagcgt tgctggatgc cgcgctgcag 
ggcggcgcgt ggctgccgtt ctcattcacc 
atcagcaggg tgcggctgga gaccaggcga 
gagagtgggc ggttgctcgc ctcgatcgat 
cagctggcga atcgggacgc tgtccgcgac 
gcgacgcagt cgacggaacc gggtcgctgg 
gacgatctca tcaaactcgc aacggattcc 
gccgagaaac ttgattccag cgcgctggtt 
caggcggatc ccggcaccgg cgcagccgca 
ctccaagcgt ggttggctga gccgcggttg 
gcagcggtga cgacggctcc gagtgacggt 
gggttgttgc gtgccgcgca ggtggagaac 
ggaaccgccg aatcgtggcg tgcgttgccg 
gccctgcgga agggcgcggt gcgagcgccc 
gtgcccgcgg ttgtggcgga tcccgaccga 
ttggggggcg cggttgcccg ccacctggtg 
acgggccgtc gtggctggga tgctcctgga 
ctcggtgccg tggtcgacgt ggtggcgtgc 
ttgctggcgg cggtcccggc ggaatttccg 
ctggccgacg gggtgatcga gtcgttgtca 
aaggcggcgg gggcgtggaa tctgcacgag 
gcgttgttct cctcgctttc cggtgttgcc 
gcgaacgcgt tcctggacgc attggcgcat 
tcgctggcct ggggcctgtg ggagcagccg 
gaccggagca ggatcgcgcg cgccaacccg 
ttcgatgccg ggctggcgct ggaccgggca 
ttcctggccg agcaggcgcg gtcgggctcg 
cccatccgtc gtaataggcg ggctagcgga 
ggggtggtgc gggagcatgc cgcggccgtg 
gtcgagcgcg ctttccggga tctgggtttt 
cgccttgccg gggtgctggg ggtgcggttg 
ccgagggcgc tggcccggtt cctgcaccag 
gcgccggtga cgacgaccag ggcaccggtc 
atgggatgcc gttttcccgg tcaggtgtcc 
gggggcgtgg atgcggtcgc ggacttccca 
ttcgatccgg acccggaacg ggctgggaag 
gacgccgatc ggttcgatgc gggtttcttc 
gatccgcagc aacggctgtt gctggagctg 
gatccgggtt cgctgagggg gagtcggacc 
tatggcgccc ggttcgccag ccgagccccg 
agtgctggga gtgtcgcgtc gggccggatt 
gtgacggtgg atactgcgtg ttcgtcgtcg 
ttgcgttccg gcgaatgcga tctcgccctt 
gggacgtttg tggaattctc ccgtcagcgg 
ttcgcggaga gcgcggacgg taccggttgg 
cggttgtcgg atgctcggcg gaatgggcat 
gtgaatcagg atggtgcgtc gaatggcttg 
gtcatccagc aggcgttggc gagtgcgggt 
gcgcatggga ccgggaccag gttgggtgat 
tatgggcgcg atcgtgatcc cggtcggccg 
ggtcatacgc aggcggcggc gggtgttgcc 
cacgggcaac ttccgcgcac gctgcacgtg 
gcggggaggg tccagctcct gacggagaac 
cgggtggggg tgtcgtcgtt cgggatcagc 
tccacggggc agatggatca ggcagcggag 
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ccgctggtgg tccccgaacg cgcggcgggt 42720 

acagtttcaa ttgaaactgc ggaagaacgt 42780 

ctgctggtga acgcacccga cgaaggcggt 42840 

gccggtgggt cgagaggtgg gggttggacg 42900 

accaccggtg gtagggcggt tcctgattgg 42960 

gatgagttct acgtcgctct ggccggaaac 43020 

cttcaggcgg catggcgtcg tggtgacgag 43080 

gccgatgcga tggcgtcggg atacctgctc 43140 

gcgtccgcgc tcggcgaccg cccggagcaa 43200 

ggcgtcgaac tttccgctcc ggcagggacg 43260 

cccgacgcga tatcggtggc cgtgatggat 43320 

tctctcaggc tacgaagcgt gtcgtcggga 43380 

gcgctgttcg aggtgacctg ggagccggtg 43440 

gccctgcttg gtgatactgc ctgcggtaaa 43500 

gccgaccgct gcgcggatct ggcggcgcta 43560 

cctgatgtcg tggtctactg cgccggagaa 43620 

cttgcggaga cccagcagac gttggctctg 43680 

gccgaggcac gtctggtggt ggtgacgtgt 43740 

gcatcagagc tggcacatgc gccgttgtgg 43800 

ccggggcagt ttgtgctggc ggacgtcgac 43860 

agtgcgttgg gctcgatgga accgcagttg 43920 

cgcttggctt cggtcgccgg gcagatcgac 43980 

accgtgctga tttcgggcgg cacgggcctg 44040 

accgaacgcg gtgtccgccg attggtgttg 44100 

atcaccgagt tggtgggtga gctgaacggc 44160 

gacgtcgcgg atcgtgctga tctggagtcg 44220 

ttgtgcggcg tggtgcatgc cgcgggggcg 44280 

ccggacgacg tgggagcggt gttcggcccg 44340 

ctgactcgtg atacggacct gtcgttcttc 44400 

ggcgctcctg gtcagggcaa ttatgcggcg 44460 

taccggcggt cacagggact gcctgcggtg 44520 

agcgggatga cggagacgct cagcgaggtc 44580 

ccgttgtcca ccaaggaggg attgcggctg 44640 

gcggtagttc cggcgaagtt ggacaggact 44700 

ctgcccgcat tgttgacggc actggtaccc 44760 

accgagctcg cggacgaggg caccctgctc 44820 

ctggggtatt cgagcgcggc tgacgtcggg 44880 

gattcgttgt ctggtgtgga gttgcggaac 44940 

ccggcgactg cggtgttcga ctatccgacg 45000 

gaactggcag acgagatcgc tacgacgcca 45060 

gccgaagacg atctcgtcgc gatagtcggg 45120 

tcgccggagg agctctggcg tttggtggcc 45180 

gccgatcgcg gctgggatct ggcaggcttg 45240 

acctacgtgc gggaaggggc cttcctcacc 45300 

gggatttccc cgcgtgaggc gttggcgatg 45360 

tcctgggagg ccattgaacg ggcagggatc 45420 

ggtgtgttcg cggggctgat gtaccacgac 45480 

gaaggtttcg aggggtatct cggcaatggc 45540 

gcgtactcgt ttggtttcga gggtcctgcg 45600 

ttggtggcgt tgcatttggc gggtcagtcg 45660 

gccggtggtg tgacggtgat gtcgacgccc 45720 

ggcctggcac cggacgggcg gtgcaagtcg 45780 

ggtgagggtg ctggtttggt gttgttggag 45840 

cgggtgttgg cggtggttcg tgggtcggcg 45900 

accgcgccga atggtccctc gcagcagcgg 45960 

ctgtcggtgt ccgatgtgga tgccgtggag 46020 

ccgattgagg cgcaggctct gattgctacg 46080 

ttgtggttgg ggtcggtgaa gtccaacatc 46140 

ggtgtgatca agatggtgat ggcgatgcgg 46200 

gatgcaccct cctcgcaggt ggattggtcg 46260 

acgccctggc ccgacagtgg tcgcccctgt 46320 

ggcaccaacg cgcacgtcat cctggaacag 46380 

ccggattcga gtcctgttct ggatgttccg 46440 
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gtggtgccgt gggtggtgtc gggcaaaaca 
ttggcgacct atttggacca aaatgttgat 
gcggtgaccc gttcggcgct ggatgagcgg 
ttgttgtctg gcctgaatgc gctggctgcc 
cctgtcggga ttggtggccg gaccgggttt 
gggatgggcc gccggttgta ctcggagttt 
tgcgccgagc tcgatgcgaa cctggggagg 
tccgacgagt ccttgctgga tcggactttg 
gtcggtctct gggaattgtt gggtacgtgg 
tcggtcgggg agctagccgc ggcgttcgcc 
cggctggtgg cgggtcgtgc gcggttgatg 
gcggtgtccg cgaccgaggc ccgagtcggc 
ggtgtcgcag cggttaacgc tccggggtcg 
gatggcattg ccggtcggct ggacgggcaa 
cacgcgtttc attcgcatcg gatggatccg 
agcgtggact accggtctcc acggctgccg 
gacgtgggcg tgatggctac gccggagtat 
ttcgccgacg gtgtccaggc gcttgtggac 
ccggacgggg cgttgtcgag cttggttcaa 
gggattccgt tggtgcggag agaccgtgat 
cagacccaca ctcgtggtgg cgcggtggac 
acgcaagtcg accttcccac gtatgccttc 
gattccggtg atgtgaccgg tgttggcctg 
gtggtgccgg tcgcgggcgg cgatgaggtg 
catccgtggc tggcggaaca ccgcgtgctg 
ctggagatgg cgtggcgggc cggtagccag 
ttggaggcac cgctggtcct gccggagcgg 
gctccggatg aggccggccg gcgcagtttg 
ggcgactggc ggcggattgc ctccgggctg 
gattcgacgg catggccgcc ggacggcgcc 
cgcctcgccg agcgcggctt gacctacggt 
cggcacggcg acgatatctt cgccgaattg 
atccacccgg cgctgctgga cgctgcactg 
tcggaagcgc gtctgccgtt ttcctggcgt 
gcgcttcggg tacggctctc gccgctgggc 
gccacagggc gacgagtcgc tgcggtggaa 
cagatcggtg ccggtcgcgg cgatcaagag 
gctgaatctg cggggatgtc tctgacctcc 
tggcacgctg cgctgaagac cactggtgtc 
ttggccaccg aggttgccaa gcggggttca 
cgaccccgag cgatgcagga gctgccgacc 
gcgatgctgc agcaatggct tgccgatgac 
acgcatcggg cggtctccgc agttgctgga 
ctgtggggct tggtccgcag cgcgcaagcg 
atggacgacg agcgagcatc gcagacggca 
cagctcgcgg tgcggtcggg agttgtgctg 
ggaggtgaag cgttcaggtg ggatgaaggc 
ctcggggccc tgctcgcacg ccatctggtc 
gcaagtcgcc gtggtctggc ggcgcccgga 
gccggcgccg acgtcgcggt cgtcgcgtgc 
ctggtggcgt cggtgcctgc ggaaaacccg 
ctggatgacg gtgtgctgat gtcgatgtcg 
aaagtggatg ccgcgtggta cctgcacgag 
gtgttgttct cctcggtcgc gggcctgttc 
ggcaacgctt tcctggatgc cttggcgcat 
tcgctggcct ccgggctgtg ggcgagtatc 
gatgtggagc ggctgtcgcg ggcaggcatt 
ttgttcgacg ctgccgttgg ctcggacgaa 
gaagcactgc gtgtgcaggc ccgatccgtg 
atggcaatgg ggccaagccg 
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cccgaagcgc tatccgccca ggcggcaacg 46500 
gtctcccctc tggacgttgg gatttcgctt 46560 
gcggtggtgc tggggtcgga tcgtgacacg 46620 
ggtcatgagg ctgctggcgt ggttacggga 46680 
gtgttcgccg gtcaaggcgg tcagtggttg 46740 
ccggcgttcg ccggtgcttt cgacgaagca 46800 
gaagtcgggg ttcgggatgt ggtgttcggc 46860 
tgggcgcagt cgggtttgtt cgcgttgcag 46920 
ggtgttcggc ccagcgtagt gctggggcat 46980 
gcaggtgtgc tgtcgatggc ggaggcggct 47040 
caggcgttgc cttctggcgg tgccatgctg 47100 
ccgctgctcg atggggtgcg ggatcgtgtt 47160 
gtggtgcttt ccggtgaccg ggatgtgctc 47220 
ggtatccggt cgaggtggtt gcgggtttcg 47280 
atgctggcgg agttcgccga gctcgcacgg 47340 
attgtctcga cgctgaccgg aaacctcgat 47400 
tgggtgcgcc aggtgcgaga gcccgtccgc 47460 
caaggcgtcg acacgattgt ggaactcggt 47520 
gagtgtgtgg cggagtccgg gcgggcgacg 47580 
gaggtccgaa cggtgctgga cgctttggcg 47640 
tgggggtcat ttttcgctgg tacgagggca 47700 
caacgacagc ggtactggct ggagccatcg 47760 
accggggcgg agcatccgct gttgggtgcc 47820 
ctgctgaccg gcaggctgtc ggtggggacg 47880 
ggcgaagtcg tcgtccccgg caccgcgttg 47940 
gtcggttgtg aacgtgtgga ggagctcacc 48000 
ggcgctgcgg cggtgcagtt ggcggtgggg 48060 
cagctctatt cccgaggcgc tgatgaagac 48120 
ttggcccagg ccaatgcggt gccgccggcg 48180 
gggcaggtcg atctggcgga gttctacgag 48240 
ccggtattcc aagggctccg cgccgcatgg 48300 
gccgggtcac cagacgcctc gggtttcggc 48360 
cacgcgatgg cgcttggtgc ttcgcccgac 48420 
ggcgcccagc tgtaccgcgc tgaaggagca 48480 
tccggtgcag tctcattgac gttggtggat 48540 
tcgctttcga cgcgaccggt ctccaccgac 48600 
cggctgctgc acgtcgagtg ggtaaggtcg 48660 
tgcgcggtgg tcggtttggg cgaaccggag 48720 
caagtcgagt cccatgcgga ccttgcttcg 48780 
gctcctggtg cggtcatcgt cccgtgcccg 48840 
gccgcgcgaa gggcgacgca acaggcgatg 48900 
cggttcgtca gtacgcgcct gatcctgctg 48960 
gaagacgtgc tcgacctggt acacgcgccg 49020 
gagcacccgg accgattcgc cttgatcgat 49080 
ctcgccgaag cgctgactgc gggagaagcg 49140 
gcgccccgcc tcggccaggt gaaggtgagt 49200 
accgtgctgg tcaccggcgg aaccggcggg 49260 
agcgcccacg gtgtgcggca cctgttgctc 49320 
gcggatgagc tggtggccga gctggagcag 49380 
gactcggcag atcgggactc gcttgcgcgg 49440 
ttgcgggtgg tggtgcacgc cgccggtgtg 49500 
ccggagcgct tggacgcggt gttgcggccc 49560 
ctgactcggg aactcggtct gtcggcgttc 49620 
ggcggtgcgg ggcagagcaa ttacgctgcc 49680 
tgccggcagg cccaggggct gcccgcgctg 49740 
gatggaatgg cgggcgacct cgctgcggca 49800 
ggcccgcttt cggcaccggg agggctggcc 49860 
ccgttgctgg caccggtgcg actggatgtc 49920 
cagacccgga ttccggaaat gctgcatggc 49980 

50000 



<210> 3 
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<211> 25360 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 3 

ccgcactccg ttcacttcca gggttgagcc 
gggcgaacgt cggcagcaag tgctccagcg 
gcacggcagg tcgagcgatg tggacatcga 
gctgacggcc atcgaactcc gcaaccgtct 
gacgctggcc ttcgaccacg gcactgcggc 
aggcaccgcg accgcgccgg caccgaggcg 
gaggtcgctc ttccaacagg cgtatgcggc 
gaaggtcgct gcccagttgc gaccggtgtt 
gaaacccgtc cagctttccc gtggtcccga 
gatcgggatg ccgcccgcac agcagtacgc 
ggacgtttcg gtgatcccga tgcctggatt 
cgaggtggcg gttcggacgc aggcggaggc 
cgtactggtc gggcattcct ccgggggctg 
gcgtcgcggg gtcgtcccgg ccggggtcgt 
cacgccgagg ttctccgtgg cgatggccca 
ggacatgcag gatgtcggta tcaccgcgat 
gactccgacg ccgatcggtg ctccgacgct 
ccctgaaggg cggccgtgga cagatgactc 
cacggtccag gtgccgggcg accacttctc 
acaggcagtc gcgagttggc ttgacaaact 
ggcgtccttt taggaccttc tgggcggcac 
tccaggctcg ccgatcttga cggcgcacga 
ctgccgctcg tggccatcgg cctggcgaat 
cgtccgagcc cgcattccga ggtgaggggc 
cccctcacct cgaaattcgt ccgatttggg 
gtctttcttg ttgacagagc ggtgagaagc 
atgatgctca agcgccaccg tttgacgacc 
ctggtcagcg gctgcggaac cgccgccgca 
gcgcgcaatg ttggtatggc ctcgggcggg 
tcggaggccg atttcctcgc caccgcgaca 
gtggcgtacg ggaaccggtc ggacaagacc 
aagggcgtgg acatgagcaa ctcgccgatc 
cccgacgcgc ccaaggaatt caccctccag 
atggtcctgg ccgacagcgg cgacccgaac 
ctgccggaca tgtccgaggc ccagccggtc 
gccgcgaagt acctgcgagt cagctcgctg 
ccatgtgagt cccgcagatt cgacctcgcc 
tttgtcgttg tgaccagcgt tgttcacgtc 
ctcgggcgcc tccaacgggg cctggcatcc 
ttctctgtcg ggcgttgtca cgccgccggc 
tggtccagcg cggtgtggcg gcggccggag 
tttttcgaac cgacgaggac cacgaccttc 
ttgcgctgca gcatcatcca ggcgcttgtg 
cttccctgcg aacgatggat ggtcaccgcg 
aagtcgtagt cgatgtcctc gtcctcgtcg 
tcgagggcgg acacgacgcc ctgcgtgccg 
ttgttgcgga tctgcgtgac cttgtcgccg 
tctggcaggc cctccctggc cggggtgatc 
gcacctgcgg ggcctcgatg catcggggcg 
cggaacttcc gcggaatccg gcgggcgacg 
ctttcctcta cgtggaacag gaagaagtcg 
gcgttgattc ggtgcgcgtt ggtcaccacc 
ttgagccgca cgtgtggaat cggggtgcca 
gctccgaccg acgggagctg gtcgacgtcg 
atcgccttgg ccagtttgtt ggctaacagc 
acgaggtcgg cgtccagcgg gttgtcccgg 
tggagcaggc ggtgcacggt cgccgcgtcg 
gctcgtcccg tcggcgcggc gaggatcacc 
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gttgcacgaa cggctggccg gattgtcgga 60 
cgtccgcgcc gatatcgcgg tggtactggg 120 
gaagcctttg gccgagctgg gtttcgactc 180 
cgctaccgcc accggactgc ggcttcccgc 240 
ggcactcgcc cagcacgtgt gcgcgcagct 300 
aaccgacgac aacgacgcca cggagcccgt 360 
tggccggata cttgacggga tggatttggt 420 
cggttcgcct ggcgagctgg aatccctgcc 480 
agagcttgcc ttggtgtgca tgccggcgct 540 
gcggatcgcc gccgggttcc gcgatgtgcg 600 
cattgcggga gaaccgctgc cgtccgccat 660 
ggtgctgcag gaattcgccg ggggctcgtt 720 
gctggcgcac gaggtagccg gtgagctgga 780 
actgctggac acctacatcc ccggtgagat 840 
ccggacgtat gagaagctcg cgactttcac 900 
gggcgggtac ttccggatgt tcaccgagtg 960 
gttcgtgcgg accgaagatt gcgtcgcaga 1020 
ctggcggcca gggtggactc tcgcggatgc 1080 
gatgatggac gagcacgccg ggtccaccgc 1140 
caaccagcgc accgctcggc aacgctgacg 1200 
cggccacccc ggcggtgccg ccttccgtgg 1260 
tgcgcggcac gcgcgctgat cgtgattccg 1320 
catgtccttt cgggcaacgt caaacgaatt 1380 
acccttgggt ggctgagccg ctcaagggtg 1440 
cggtggacgc aaccccggtg ggcgtggtgc 1500 
cgctgacaca cctgagagga aaaggggagc 1560 
gccatcaccg gccttctggg gggagtactg 1620 
cttcagtcct cgccggcgcc cgggcatgac 1680 
ggcggcgggg acatcggcac gtcgaactgc 1740 
ccggtgaaag gcgaccccgg cagtttcatc 1800 
tgcacgatca acggcggcgt gccgaacctc 1860 
gaggacctgc cggtcgagga cgtgcggctt 1920 
cccggtcaga gcgcgtacgc cggcattggc 1980 
gcccatgtcc tcaccgggtt ccagtcctcg 2040 
aacgttctcg gcgacggcaa cgtgaagttc 2100 
gtgtctaccg cagacgagct gcgctaaaac 2160 
gtgcggcgcc tccggcgaag cgtccgtacg 2220 
cgggcgcagc gctggtacat actcaggcgt 2280 
ggggccgtcg agtgcggcgg cgctgacgcg 2340 
ctcgaaccgg tcccgccccg tcggagccgg 2400 
ccgacggtgc gcaccgcctg cccgagggcc 2460 
ttggcccggg tgaccgccgt gtagagcagg 2520 
gtcaagggga tcaccacgca cgggtattcg 2580 
taggcgtgga ccagttcgtc gagttctgtg 2640 
gttcgcacgg tcatggtctg tgcttcgttg 2700 
ttgaacacgc cgttggcgcc cttgtcgtag 2760 
acgcggaaga tccgtccgcc gaaccgccgc 2820 
gcttcctgca acagctggtt cagcgcgcct 2880 
aggacctgca cgtcggtgcg cgggttgaac 2940 
acgtcgacgg tgagctcggc ggtcggttcg 3000 
gtcagcccgt gtgtcagcgg atagtccccg 3060 
ccggactcgg cggcctgccg gaacacctcg 3120 
ggggcgagca gatcgcgcag tacctcaccg 3180 
ccgaccagca gcaggtgcgc gccgggcgcg 3240 
aggtcgagca tggacgcctc gtcgaccacg 3300 
tcgtaggcgg cgtccccgcc cggctggagt 3360 
tgtccggtga gctcggtcag ccgcttcgcc 3420 
ttggcctttt tcgcctgagc taatgcgatg 3480 
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atcgaccgca cggtgaagct cttgccgcag 
tcggtcaggg ccagcttgac ggcgcgctcc 
cggcgcaacc agtcgagggc cttgtgccaa 
gcgctggtgt tcagcagccg ggacagctgg 
ggcacgaggt agatcgcgac cgtcggcacc 
accacacctt cctcggtgac gagttcggcg 
gcgaggatct tcaccgcctc ggcgatcagc 
ccggtggact ccgacagcgt gaactgaagg 
gggattccca ccgctttggc gatggtgtcg 
cctgccagcc ggtatggctc ttccttgacg 
tagatcttca ccgccagcga ggtcgagacg 
ttgatcgcct tctgctcctc ccacgcgtcg 
ttggggacct cgatcagccg cgcgggttcc 
aagtggtcga cgatcttctc ggcgagtttg 
aggtagcggc ggataccttg cacggtcgca 
aactgccgcc cgtactgggg gtgcgacccc 
ggctgcgcgc ccagcagcgc gccgacgacc 
atccgcgcga cggtgtagcc gctctcctcg 
tccagcaccg cagtccacgt ggccgactcc 
gaacggctgt caagcaaacc ggcggtcacc 
gctccggcgt cagcgggtgg gcgttcgcga 
ttttttcgtc ctcgcgcagg gcctggagta 
gccgggataa accggcttga cggaggagat 
ccacgacgac cccgagtcat gcgtcgacga 
gggcacgtgg gggcgtgcga tcaggtcgcg 
cagcagccgc gacggactgg ttccgctcct 
agctcgatgc cggccgcgtt cgcgaggtgg 
agtctgcctt ttgaacatct ggtcaattca 
ctgcccgcca attgtttgct cgccgtgcgc 
gcgtgctggc gaaggtgtcg attggctcgg 
tcgccacgag cgtcggcgat gcccggtact 
tccacgggcg cggtgcgctc ctgatcatga 
gccgtcgtgg ggcactgtga ggatgtgcat 
cctgatcgcg ggctgccgcg gctgtgccca 
gagcgtcagc ccctgcgcgt cgagtgcgac 
ctgctcggtt gccggtgcgt tcgcctgcac 
cacatcggtg acgaccgagc cattcgcggt 
caggaactcc gcctgctgcg gtccgttcgc 
ccagcaggcg tggtcgttgt tgttcatcga 
caggaacgtc agaccggtgc ggcccagcga 
gaggtagaac tgcggcctgc gtaggaacag 
ggtgaagtgc gcctgttggt atggaagttt 
cggatagcgc tcgccgtaga tcgcgtgggt 
gagtgcgggc accggtgccg ggtcctcggc 
gcggctggac aggaacgcgc ccaagttcgg 
ggtacgttcc gggtccggtc gcacgtcggc 
ggcgacgttg gcgaaccagc cggagccgtc 
gtcggtgtac ttccgcgcca tcgtgacgag 
ggccatctcc ggcagcatca cctcgaagtt 
gaagtagccc gaaggactct gcccgttcgc 
ctgccgcagc gcggggtcgg ggtcctggtt 
ggtaacctgg ttcgcgtagt ggattgtggg 
gagccacgtc atcgctttgc gcagcgccga 
gttcgccgcg cgcagcaggg cgttggtctt 
ggcgagcccg tgctcggtcg gcgaatactc 
taggtagtgt ccgagcccgg cgtcgagcgc 
gttccacgtg cgggattgcg tggcgaacca 
ggcgttgtag gacaccgccg ggctgcgcca 
catgtcggcg accatcggcg agaccgcggt 
tgcgaacagg cgccggttcg gtgtggtggg 
cggcagtgcg gcgagcagtc cgagagcggc 
cgtagtcacg ggcctacctc cttgtggccg 
tcaagatgat aattcgaatt attatgggct 



LeA33955-US 
cctggacctc cggtgagcac ggcgaccttc 3540 
tgcgcctcgg cgagttcggc accggtagcg 3600 
tcgacgtcgg cgaagacggg catccggtcc 3660 
ttggccaggg cgacttcggc gcggtggaag 3720 
tcgtcgtcat cggtggggat ctcctcgcgg 3780 
aggcattcga tcaccagccc ggtgtcgacg 3840 
tcgttctccg gcaggtagca gttgccgtcg 3900 
cccgccttta cccgctgcgg ggagtcgtgc 3960 
gcggtcttga aaccgattcc ccacacgtcg 4020 
gtccggatcg cgtcgtcgtg gtactgcttg 4080 
ccgacgcctt gcaggaagat catcacctcc 4140 
gcgatcagct tcgtccgctt cgggccgagc 4200 
tgctcgatga cgtcgagcgc ggcgacgccg 4260 
gggccgatgc ccttgatcag gccagacccc 4320 
ggcagcacgg tcgtgtagtc gtcgacgtgg 4380 
caccggccgc gcatgcgcaa cgcctcgccg 4440 
gtcaccaggt caccgccccg gccggtgtcg 4500 
ttggcgaacg tgatccgctc cagcgtgccc 4560 
cgtccttttt ccaccgacaa cacgtatcac 4620 
acatgcagcg gcatctcccg aacgcctcgg 4680 
tgccttggtg cggccggtgg gagttgtaga 4740 
ggtgccgctg gctccagatc aggggaaagc 4800 
ggaagagctc tacgtcgaat ggccagcgac 4860 
cctgcgaggc cgtggcgaag cgttgacagg 4920 
gcgccggctc agcgccgtgt ccggacgcac 4980 
cgagcgcggt gtcctcgatc gtctttttcg 5040 
ccagtccagt cggtcggcag tcgtggtcgc 5100 
gctttgaatg atcgttcggc gagtattgtc 5160 
cctgggacag gagcgcggcg ttcggggacg 5220 
acggccggac caggatcgcc gacgtcgtga 5280 
atctggagca ccagcgagtg gacaacgact 5340 
atcaagaact gagaccgtca gggtgaacgt 5400 
cacacggccg gggtaggtga tcgaggaggc 5460 
gtcgaatcgc agcaccacct ggccgcgggt 5520 
tttgtcggag tccctcacca ccaacgggat 5580 
cgacctggtg atccggtcgg gccgcacggt 5640 
gcggtagcgg aaccggtatg gctcgccgcc 5700 
gtccggcacc tggcccggga agaccgtcgc 5760 
ctggacgacc gtgcccgcga ccgggtgcca 5820 
ggtcgcgcgc ctcgtgccga agtagccgcc 5880 
gaagtcttgc ccctggtcgc tgcggatctc 5940 
cgcgatcgcg gccgtgcgct gggcgcgggt 6000 
gaggatgcgc ggtgaggtgt cctgtttgac 6060 
ccaggccgct cgtgcggcgg cgaggtcttc 6120 
caccactggc acgaactggt tgttcagcgc 6180 
gtagaaccgc gccgaggtgc gggagttggg 6240 
gggttcgcgc agcaggttgt agctgagcca 6300 
gttggcgttg ttggactggc gccacgcgtc 6360 
gtagttggtg tccgcgccgg tcaactcgta 6420 
ggcaaggcgc gcgaacgcct cggtcaactg 6480 
cagcgcgagc gaggagccgg cgagcccggc 6540 
ctgccaggcg ctaccgttgc ccgggttgag 6600 
ggtgatctgc gactggcgtt gcggcaggat 6660 
gctcaggtac cccaacccga agccggtggc 6720 
cggccacgag ccgtcatcgt gctgcaaccc 6780 
ggcgagcaga gtcgcgtcgc cccggtacgg 6840 
ggcgagcgtg tagacgtgct cctggacgcg 6900 
ccagcctccg gcgaagaagc cgctcgagtc 6960 
gaggtacgac gcgaatcgct gctcctcggg 7020 
cggcggggcg ggcagtgcga gtgcgcgggt 7080 
ggcgccggtc atcaggctgc gacggctgaa 7140 
atcaaccctc acccgctgcg tagccgcacg 7200 
tgacgacgcg taggccgacg acgcagaatt 7260 
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cctgccaatt cgtattggca agcgggggtg 
atcgcccgga tggcaggagt cagccaggcc 
gacggcaacg tccggatcgc ggcggagacc 
accggctacg tcgcgaaccc gatcgcccgc 
ggcgtgttca cctacgaggc ggtgttcccg 
ctcgaaggca tcgaggaaca ggccgaggag 
gccaaggcca ccggggagcg gcggcggatt 
gacggcagtc tgctgctcgg tcgcacggtc 
gaaagcatcc cgtacgtctc catcggacga 
gtcggggccg actaccgcac cgcggtgcga 
caccgcgggt tcgcgtacgt ggggtctggt 
cgaggcttcc gcgaagccgt tgccgcacat 
cagctcgatc agctgcgcga agcgggcgtc 
ggggccgcgc tcgtgctcgc cgggcgcgaa 
gtgctctcgc tcggtgccgc tacccggtcg 
ttccgcatcc ccaggcgcga gatggggcgc 
gaaaacggca ccacaccgca agaactgctc 
ggcgcaccac gcctttgacc aggaggaact 
cgttggcggc ggactcggcg gtgtcgccgc 
ggtcgtgctc accgaggagt acgactggct 
gcccgacgag cacagctggg tggagcgctt 
cgacggcatc cgcgactact accgccgcca 
gcgggagctc aaccccggtg cgggcaacgt 
cgtcgcggtg atcgacgaga tgctggcgcc 
gcagccgtac cggccggtgg ccgcgcacaa 
tgcgcaccgc gacaccggtg aacagatcga 
ggagacgggt gaactgcttc cgttgtccag 
tctggacacc ggcgagccga gtgcgcccga 
gtcggtgtgc ttcgtggtcg accacgtcga 
gtacgacttc tggcgcgcgt accagccgga 
ctcccccaac ccgcgcacgc tcgcgatctc 
cgacccgctc ggcgtcgtgt cggaccagcg 
gttccggcgc atcgccgcgc gtcgcaactt 
cctggtgaac tggccgatca tcgactactt 
cgacgcgcac atcgccgcgg cgcgggaact 
cgaggcgccg cgcccagacg gcggcaccgg 
caccggcagc gcggacggtc tcgcgcaggc 
ggccgagcac acgatcgtcg aacaggacct 
ggtgcagcac gccgacgccg tgggtgtcgg 
cggtggcgac aactacatcg acgtcgcgag 
gatcccgcaa cgggtggaga acctgctacc 
caccaacggt tcccaccggc tgcacccagt 
gctcgctgcc ttctgcctgg cgcaccgagt 
cctgctcgcg gacttccagc agtgtctgga 
cgtgtccggc tactgacgca gggagacgaa 
actcatcgtc gcagggctgt tcgccctcac 
cggaccgaag tcgctgcgca tgaccgtgtg 
caacgacatc gccgccgagt acaaggcctc 
ctcggtgccc gccgacggct acaccaccac 
gccggacctg gcctggatcc tggaggagtc 
cgccccggtg cgcggcaaga tcgagaaggc 
gtgggagaag gacggcgaac tgtacgccta 
cgtcaacacc gacctggtga agggcgcctc 
ggctgcctct gcgtcggcgg ccgcctccgg 
gtaccagaac tgggcagtgc tgtcctctat 
cgcggacggt cgctcgtgcg ggttctccag 
gcacaaggcc atcttcaccg acaaggcgat 
cgccggcgac gcggcgatgg cgatcggcca 
gaagttcggc tggacgctgc tgccgctgcc 
cgggcaggcc gggatcggtg tgctgaagca 
cctcgccttc ctcaccaacc agaccaactc 
gcggtcgtcg ctgctcaacg cggagacgct 
gcaactgcag tccgtcgtcg tcgacggcat 
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ctcgtggccc gacggctcac ccagcaggac 7320 
acggtgtcgc tggtgctcaa caaccggaag 7380 
cgtgcgcagg tactggaggt gattcggaag 7440 
aggatgcgcg atcggcacaa ccgcatcctc 7500 
agcacccacg cgaacttcta ccagtcgttc 7560 
gtcggctgtg acctgttgtt gttcaccagc 7620 
ttcggcgacg acagccgggt gcggctcgcc 7680 
gaccgcgacg acctgaccca gctgctcgcc 7740 
cgcgacgacg cgggcggtcc ggtgccgcac 7800 
gacctggtgg accgcgcggt cgcgctcggc 7860 
gggggcgcgg agtcgtccgc ggatcggctg 7920 
ggcgtccaag ggatgcatgt ggagacccca 7980 
accgctgtgc tcaccgaaga ggtgtcggac 8040 
cgcgggctct ccgtgccggg cgacctcgcc 8100 
gcaccggacg acgacgtggc gctcaccggt 8160 
cgggcggtgc aggcgctgac cgaggtgctc 8220 
ccgtgcgagt tcgtcgaggg ctcgacgctg 8280 
gttgctcgac cacaccacgg acgttgtcgt 8340 
cgcactcgcg ttgctgcgcg cgggccgccg 8400 
cggcggccag ctgaccagcc aggccgtgcc 8460 
cggcgtcacc gcgagctacc gggcgttacg 8520 
ctacccgctg accccgcgcg cacgggcgtg 8580 
gagcaggctc tgccacgagc cccgcgtcgc 8640 
gttccgcggc agtggcaggc tgaccgtgct 8700 
cgacggcgac cggatcgtgt cggtgaccgt 8760 
gctctccgcg ccgtacatcc tggacgcgac 8820 
cacggagtac gtcaccggct tcgagtccac 8880 
cgtcgcgcag ccggcgaaca tgcaggcggt 8940 
cggcgaccac accatcgaca aaccggcgcg 9000 
cttctggggc gaccggatgc tgtcgttccg 9060 
cgaacgtacg ttcaccccga acccggacga 9120 
gctcagtgcc ggtgacagca atctgtggac 9180 
cgtcgagggt gcctacgaca gcgacatctg 9240 
cgagtcgccg gtgatcgacg tgccggacgc 9300 
ctcacgttcg gtgctctact ggctacagac 9360 
cttccccggc ctccgcctgc gcggcgacgt 9420 
gccgtacatc cgcgagtcca ggcgcatcag 9480 
ctcgctcgcc gtgcgcggcg acaagggtgc 9540 
catgtaccgt atcgacctgc acccctccac 9600 
ctgcccgttc gagatcccgc tcggcgcgct 9660 
cgcgggcaag aacatcggca ccacccacat 9720 
cgagtggaac gtcggcgagg tcgcgggcgc 9780 
cacccctcgc gcggtgcgca atacccctgg 9840 
acgcgacggg gtcgagctcc gctggccgga 9900 
aatgacaaag ctgtcacgac gactcacggc 9960 
cggctgcggt ggtggatcaa ccgcacagtc 10020 
gactgccaac gcggcgcatc tcaagctgct 10080 
gcacccggac atcgccgaga tcaagttcga 10140 
gctcaccacc cagatcgccg gcggtaacgc 10200 
ggcaccggac ttcgtggcgt ccggtgcgct 10260 
cgacgagctc gtgccgtccg cgacgaagct 10320 
cccgttctcc acctcgccgt tcggcttgtt 10380 
ggcggactgg acctgggacc aggcgatcgc 10440 
caagggcggc ctggtactgc cggacttcaa 10500 
ctggcgcggc tggggagctg atgcgtggag 10560 
cagcgagatg aacgacgcga tgtccttctt 10620 
tccgggcccc ggcacgacgg tggacttctt 10680 
gatctcccgg tccagtgcgt tgaaggacgc 10740 
ggccggtccg aagggtgact acgcggtgat 10800 
gtcgcacaac gtcgacgccg cgacggactt 10860 
cgccaagctc gcgcagttct tccccccggc 10920 
cgccaagagc aacccgttga tcaaggccga 10980 
caacaagggc gtcgtgaagc cgagccacaa 11040 
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gggtcaggag gagctgaacc agacgatccg 
cgcgaacgtg cagaacgtgc tgaacgacgt 
caagtgacgg cggtcgcaca ctccacgcgg 
acgcagcgca ggcgcgacaa tctggttggt 
agcatcgcgt tcgtgctggt gccgctggcc 
aacgtgctcg ccggcacgtt cgagttcgtc 
gacgagaagc tgcgcgactc gctggtggcg 
ctcaacctgt cactagcgct gctgctggcc 
acggtcttcc gcacgttgtt cttctctccc 
gtgtggcagc tgatactgca gccggagggc 
gccgacgggc cgaactggct gcgcggtgag 
caggtgctca agaacgtcgg cctgaacatg 
ccgcagccgt tgtacgaggc ggcgaagatc 
cgcatcacct tgccgttgat cagcccgacg 
ggctcgctgc aggtgttcgc gcagatcgcg 
acgaccgtac tgatctacta cctgtaccag 
ggcgcgacga tctcggtact gctgttcgtc 
cagatgcgcc ggaagtgggt gctgcatgag 
gctgtgcctg ctgtgcgtgc ctttcgtgtt 
gaagcccatc agcgagatct tctccacctc 
ctaccggcag gtgttcgaga tgcagccgtt 
cgcggtgatc gtcacgacag gcacaatggc 
gcgtatccgg ttcccgggcc agaacgtgct 
cccgagcgag gtcaccatcg tgccgctgtt 
cacgcactgg ccgctaatca tcgtgccgat 
catcatgcgc cagttcttca tcgccctgcc 
cggcctcgga cgagcgggca tcttctaccg 
gggcgccgtc gcgatcttca cgttcctgca 
ctatctgtcc acaccggaca tgtacacgtt 
ctacggcggc ccgatgtgga acgtgcagct 
tctcgtcgtg ttcgtgctgg cgcagcggca 
taaggggtga gtcccgcgat cgacccggag 
ccggtgacgc cagcgggaat cgacgcggtc 
gagctgacgc gcggtggcac cgtgacgttc 
cttgtgctgc gcccagcggg tgtgccacgt 
atggtcgcgg gcacccggcg caccgacctg 
ggtgtcgtgc tggtgtcgcc ggagtaccgg 
tcgcaggact gcttccgggt gctggagtgg 
gttgcgggta catcggcggg cggcgggctg 
ctcggcggtc cgccgatctt gccgtcggcg 
gccaacggcg tgctaccgcg ccagtgcagc 
cgccgcagac tgcgttcgca cgcctcgacg 
accttgctga ccaggaaaac ctcgtcgcgg 
tcggcgcccc cgctgccgta catctcggcg 
ccgcgccgca gcgcggcgac ctccgccgcg 
gtgccctgcc ccagcaccgg caattcctgc 
actccgtttc agtcagcgct gctgcggcag 
ggacggcccg gcggccaacc gcccaccggc 
cccggacccg ccgctgccga cctcgatggc 
gacggcgacg tgcagcgagc cctcgcggaa 
tgcgtgcttg gaagcgttcg tcaacgcctc 
cgcaggcgac accccgtgcg ggtcgcggat 
ggcgcgcgcc cgcacgacca gctccgggat 
gccgagcccg agcgtcgcgc gcatctcgtc 
cccggcggtc tccttcgcac cgggatcctc 
cgcgattaag gtcacgtgat gtcccaccgc 
ctcctccgcg cgcgccgtct cggccttcgc 
cttccgcaga ctgtccgcca gctgctcccg 
cgcgcctgcc gccagcagca cgaacgccac 
ctgcatgatc atcaccggtg ccacggcgac 
catctggatc cgcggctgcg cgcgccccaa 
acccaaacca ccggccaacg ctggcatgca 
caacggcagc agcgccgaag ccaccaaacc 
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cgccgcgctc gacccgttgt ggaagccgga 11100 
gtgcaccaag atcaaaccgc tgctggagaa 11160 
gccgtagccc gcaaggggac gtcgtactgg 11220 
tacctgttcg tcgcgcccgc gctgctcggc 11280 
ttggtcggct ggtacagcct caacgagtgg 11340 
ggcgcgcaga actaccaaca gctgctcgcc 11400 
accacttggt tcgccgccgg cctggtcgtg 11460 
gtgctgctca accagaagct gagcggcacc 11520 
gtcctcgtgt cgctggtggc gtggacgctg 11580 
agcgtcaacg gactgctcgg gttcttcggt 11640 
tcgaccgcga tggtctcggt gatcgtcgtg 11700 
gtgctgttcc tcgcggcgct gcaaggcgtg 11760 
gacggggcgg gtgcctggac ccggttccgc 11820 
atcctgctca cgtcgatcat caccatcgtc 11880 
gtgctcacgc agggcggtcc gggcacgtcc 11940 
caggcgttcc agttccacca cttcggctac 12000 
atcgtcgccg cactcaccct gctgcagtgg 12060 
gcttagggtc aagatcacgc tatacgggct 12120 
cccaacctgg tggatgatca cctcgtcgat 12180 
gccgctgccg tcggagtgga cgttctccac 12240 
cgcgcagcag tactggaaca gcctctacat 12300 
ggtggcggcg atggccggat acgcgttcgc 12360 
gttcgtggtc gtcctgatcg gcctgctcat 12420 
caagatgttc cagtcgctcg gcctgaccaa 12480 
cttcggggcg ccctgtgtgc tggcgatctt 12540 
gtcggagctg gaggaggccg cgcggatgga 12600 
ggtggccctg ccgctgtccc ggcccgcgct 12660 
ctcgtggaac ctctacctgg agccgatcgt 12720 
gccgcaagcg ctcacgcagt tcgtcgacaa 12780 
cgccgccgcg accatgaccg cagtgccgat 12840 
gttcatcgag ggactcgcgc acaccggggt 12900 
atcgccgcgc tggtagacct cgccagacca 12960 
cgggcgggag gacgcgtcgt caccgatgcc 13020 
gccgatgccg atgccgatgg cgtgcccgtg 13080 
cttccggtgc tgcacctgca cggcggcggg 13140 
cacgtgctgg ccgagtgggt gtcggagctg 13200 
ctcgcacccg agcacccgca cccggcgccg 13260 
atgtcccgca acggcttcgg cccgcctgtc 13320 
gcggctgcgg tgacgctgat ggcccgcgac 13380 
cgcaactgct cgaacgcgtc cagcgtctcc 13440 
aggtagacgt ccaggtgatc ggtgcccaac 13500 
gcgccacgcc gacccgcgtt gtgcggatac 13560 
cgcccggcga tcgcggcacc gacgacctcc 13620 
gtgtcgatca gccccaggcc gaggtccagg 13680 
cgctgcccac gccgctcccc catgccccag 13740 
ccgccacgca gttgaaggcc tcgcatgcac 13800 
cacggccacc agctcgaagc caccgtcctc 13860 
catctgcacc cgctccgaca ggccggccag 13920 
cggtccccgc cggtcggggg cgccgtttgt 13980 
caccagccgc accttgatgt cagcaccggg 14040 
ctggacgacc cggtaagccg cgcgctcaac 14100 
ctccgaggaa agctccacat ccacaccggc 14160 
cgtgctcaac ccggcccgct gcctgggggc 14220 
cagcgcctgc ttcgccaacc cgcgcagccg 14280 
ggtggtggcc gccaacgccg ccgactcgac 14340 
gtcgtggatc tcccgcgcga tccgggcccg 14400 
ctgcaactcc gcctcggtgg ccgcctccag 14460 
caacgccacc agcgccccca atgccgtcgg 14520 
ggtcaggatg atcgacccga cccccagcga 14580 
cgccgcggtc agcaacaccc agccgaccaa 14640 
cgcgtacaac gccacgaccg tcggcgccca 14700 
cagcagcacc gcgatccgcg gccaccgccg 14760 
ggacaccgtc gcatagccga acgggtcggg 14820 
Page 31 



cgcggccagc acagctgccg ccggaatccc 
ccagcgcacc gaactcaccg gaagccgccc 
acccggttct gcgcgccgag cttgctcaac 
tcggtcagcc cgagccggac cccgatatcg 
agcacctgcc gctcccgccc ggacaacccc 
gacagactgc ggaacctggg caacaaccgc 
ccggccgcca gatcgtgcac cgcccgaacc 
aacccgtgcg ccccgaaccg caaggcatcg 
agcaccgccg gcaccggcgg attcgccaac 
tccttgccgg gcatctgcac gtcgatgagg 
gccagcaact catccccgtt gcccgcctcg 
aacaacaccc gcaggccacg ccgcagcatc 
ggccccccgc cgtcgtcagc ccgcacctct 
cagactagcg gggccgcgaa agcgcagcgc 
cgcccgccct tggccttccg ctcccgcacc 
aacccgaacg tctcccggaa cctgcgctcc 
ccggtggtga acagcacgat cgtcggcggc 
ttcggctgct tgccgccccg caccggcggc 
ccgttcaacc ggccggtgga gatccgggtg 
ggggccagtt tcgccaccgc gcgaccggtc 
cgcacccgca ccagctcgcg gtcgatctcc 
accaggtccc acttgttgta ggcgatcacc 
atcacccgca ggtcctgctc gctcaacggc 
tcggccgcct cgatcgcggc cttggtgcgc 
gtcttcacgc gcttgcgcag cccggcggtg 
tccaccagcg agtccaccgg gtccaccgtg 
tcttccccgg tcagcttgtt cagcaggctc 
gccacccgcc gcggcccacc ggtcgcgccg 
tccaggacgg cgtccagcag gtcaccggaa 
ggctcgccca accccagcga ccacagcgaa 
accttgttgg cggccagcag caccggacgc 
tcctcggtct cggtggcacc gacccgggcg 
tgcatggcca gttccgcctg cgcggccacc 
cagccaccgg tgtcgaccac cgtgaagcgg 
cgatcccggg tgacaccagg ggtgtcctgc 
ttcaccagcg tcgacttgcc cacgttggga 
ttcgtcggct cctcgccctc gaccgcttcg 
tcggaccacg tgccgtccag gccttccacc 
aatagtcttg gctctggtcc gcgaccagcc 
cggtgcgcac cgcctcggtc gccacgacca 
cgatcggctc cccgaacaac acgtcgatcc 
ggcggcgcgt gccccggcag gccaccggca 
acgccgcacc gtgctgcgca ttggtcacat 
ccccgacgag accaccggcc cgcagcaccc 
gcgcaccgcg gcgcaccccg atctgcccga 
ggaacatctc ctgcttgatc aggaacaccg 
gcccgtccat catcgaactg tggttggcca 
gctccgcgcc gtggatccgc acccggtacg 
ggccgcaccg gtgcattcca ccggacgcgc 
cgtccgctcc gccgcgacca gcaggccacg 
cacctccacc acacccaggt ccgtggtgtc 
cggcgccacc ttccgccccg agtccagcgc 
gtccaggtca ccggcccgtc cctcggagac 
gtgcgcggac gccgtcaggt agaccttcaa 
gtcgcgtccc tcgacgacca tgccgcctgg 
ctggtgggcg accaactgct cccgcacctc 
ggtcacctcc ggcccgcgga tctcccgccc 
gccctccgga tcggtgccca ccgtcagccg 
cggatccgac gggtcgacct cggcgcgcag 
tccggtgtcg aggtaggtag cccccaatgc 
gccggtgccc gaagggccgt cgagcgccac 
catgtccttc ccgagccgga accgatcggc 
tgcctggtcc gctcctgccg atcccaatcg 
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gacgaccagc gcctcaacgc cgagccgaga 14880 
gccacgcgct gcgcgatcaa cgcggcctgc 14940 
accgtcgaca cgtagctctt gaccgtcgcc 15000 
gcgttggacc gaccatccgc caacaactgg 15060 
tgcaccagct gcgtctcgtg cgcgttgtcg 15120 
gcagtgatcc gcggatccag caccgcccca 15180 
agcaccgcgg gctccgcgtc cttgagcagg 15240 
gccacgtaat cgtccaggtc gaacgtcgtc 15300 
gtcgccaagt cccgcaacgc cgtcagcccg 15360 
gcgacgtcga cgcggtacgc gcgcaccgcg 15420 
gcgaccaccc ggatcgagcc gtcgccctcc 15480 
gcctcgtcat cggcaagcac aacgcggatc 15540 
cggtcacctc cctgttccga gggtcgccag 15600 
acacgccacc cgagcgcact gtgtcagcgg 15660 
cgcaccgaga tccgcaccgg gctacccacg 15720 
aggaaccgcc ggtacccggc ctccaagaac 15780 
cgcgactgcg cctgggtggc gaacaggatc 15840 
gggttggccg cgaccagctc ggaaagccaa 15900 
tcccacgaat ccagcgccgt gcgcagcgtc 15960 
ttcgccgaca cgttgacccg ctcggcccac 16020 
ttctccaact ggtggcggcg gtcctcgtcg 16080 
aatgcccgac cggcctcgac gaccatagtg 16140 
tcggagccgt cgatgagcac gatcaccacc 16200 
agcgacgcgt agtactcggt gccgcttgcg 16260 
tcgacgaacc gccacacctg gccgtccagc 16320 
gtgcccgcga cgtcgtgcac caccgaccgc 16380 
gacttgccca cgttcggctt gccgaccaat 16440 
aagatctcgc gcggcgtctc cgggaacacc 16500 
ccgcgcccgt gcaacgcgct gaccggcatc 16560 
tgcacgtcgg agacgcctcg ctggtcgtcg 16620 
ttggaccgcc gcagcaccct ggccacggct 16680 
tccaccacga gcagcaccgc atcggcggtg 16740 
gacgcctgca gtcccttcgc gtccggctcc 16800 
cgcccgttcc acaacgcgtc gtaggccacc 16860 
acgacggctt cccgacggcc cagcagccgg 16920 
cggccgacca cggccagcac.cggctgcggc 16980 
acctcgtcga actcagccca ctcagcctcg 17040 
gactcgtcgg tcacgctcgt ttccctctcg 17100 
gatccagctc ggcgatcagc tcgaccagtt 17160 
gcccggcgcg gcctttgccc gccggcagca 17220 
tcggtagcag cctacggccc gggccatccg 17280 
ggatcagcgc ctccgacgtc cgcgccaacc 17340 
cgccatcacc ccgggtgccc tccgggaaca 17400 
gcaccgccgc cagcagcggg gcgcggtccg 17460 
tgcggcgcag gaaccagccc agcgcaccgc 17520 
cgtggcgcgg gatcattccg aacagcagcg 17580 
ccagtaccac cgggcccgtg cgcgggatcc 17640 
gcagctgcac gatccggcgc gaaatccact 17700 
cctcgggcag ggtctcgtcg ctcatggcgt 17760 
gcactccacg tgcttgcgca actgcgtgag 17820 
gagctccacc gcgtcgtcgg ccttgcgcag 17880 
atcgcggcgc tgcacgtccg cgtgggtccg 17940 
gtcctgcgcc gtgcggcgct gcgcgcgggc 18000 
acccgcatcc ggggccacca cggtgccgat 18060 
cgacaccagc gcctcaccga tcagcttccg 18120 
ggcgaccgcc gacaccgccg acaccgcgcc 18180 
cacgtccgag ccgtccagga agatctcagg 18240 
cgctgcgcgc accacgcctg ccactgcggc 18300 
caccgccagc gtcacggcgc ggtacatcgc 18360 
gaacgccaac ttgcgggaca ccgtcgactt 18420 
cacaccaagg agcctggcgt gtgccacgtg 18480 
ccgacacccg gccggaccaa ccgtccattc 18540 
catccagatc caatccggac tgcgggacaa 18600 
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atacgaaagt tctacgaata aggtcttggg 
gctcggcacg cagcaggact tcaccacccg 
ccgggacggc cgcttcgagc tgatcgtctc 
ctcggtcgcc ttgaccacct ccgagaccag 
gctggtcgcc cagctcaacg agcagcacag 
gtccgtcgcc ccgggctccc cctacgccgg 
ccgcacctcc gtgtcgatcg tcgcggtcgt 
gccggacttc cagttctccg ccggcgacct 
acgcgcagcc agcgaaatcc tggaaaaggg 
gccacctgac cgggacacgc gtcaccaacg 
ctccatacgc agggcacacg cgagaggaaa 
agaactgggt gcggttttct tcggtctggg 
ggtgtcaccg atcccgctct acctgatcgg 
ccccctgcac ggcatcgaac cgttcaccca 
gttgttgctg ttaggcctgg aatactccgc 
ctggctggcc ggtgtcatag acatcgtgct 
gctgctgggt tggggtccgc tcggcgcgtt 
ctccggcatc atcgcgaagg tgctcggaga 
ggtgatcctg tcgatcctgg tcttcgaaga 
gaccgccgtg ctggccgggg tcagcttcct 
ggtcgtgatc accctggtgc tggtggtcgc 
ggtcgacagc cccgacccgg aggtcttcct 
cgcggggatc gcctcggagc tgcaggtctc 
cgcgatctcc gggtcgaccg cggcgaacgc 
gttcgccgcg ctgttcttcg tcgtgttcgg 
ggtgctgccg gtggcgctgc tgctggcggt 
ctggttcgcc gcccggatgc agggtgtcgg 
gctggtggcg cgcggcgaat tctccatcgt 
ggtgccgggc gagctcgccg ccctcgccac 
cccggtcgcg gcccgcgtcg tcgagccggt 
gaaggcctga ccggaaaagc ctccggctgt 
cgctgcggat aacgcgggcg ggtctggcaa 
ggcccactgc gcggtagagc gagccgatct 
ggcgctcgtt ggtcagccgc acatcgccga 
agccgacgtg cttgagcagc cgccgcacga 
ccaccagcga ccggttctgg ttcatgtcga 
cgtcctccag ctccacgccc tcgcgcagct 
gcacctccgc caggtacgtc ttgcgcacct 
gctcgccgtc gttggtgatc agcagcaggc 
ggaacagctt gccctcccgt tcccgcaggt 
ccgacatggt gcacaggatg ccgcgcggct 
tgaccatcac ccggttgccg tccacgtgga 
ccagctcggt gacgacctcg ccgtcgacct 
ccgcgcgccg cgaggccacc ccggcctggg 
aatcggcagt gctgctgtac ggggaacggt 
ggcaacaagg gggcgagcgg cggcaactcc 
aacagctcgg tcgtgcagta caggatgccg 
ataaggccgc gccccaccag ggtacggatg 
gcgacccgcg aacgcgtcac cggctgccgg 
gcacgcgtca gcttcgaacg ctgcccgtcc 
ttctcccgcg tgtagaaccg ccagccgtcg 
gcctcggcgt agccgtccga gagccgccgc 
tccagggtgt cggccagcag ctcctcgccg 
tccagcgccg catccagcgc cgaatcggag 
gcctcggaat ccccagcggc cgccgacggc 
ggggtcggcg actcctccgc gacctcgccc 
actcctcttc ttcatggctg gcgcgctgcg 
tgccgcccac ccagctcacg atcagctcac 
ccttctcccg gtagagctcg agcagcgcca 
cgcagtccga caccagctcc gagaacgtcg 
gcagcgccgc gtgctcgcgc accgaaacgc 
tcggcggtgg cttcggccgg aacaccgcgg 
ccccgatcac cacttccggc agcaggttcg 
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tgtggatgta actgtgaccc ccttgccagg 18660 
atccggccac cgcatcggcg tgatcaccta 18720 
cgaccacgaa gacccagaca aggtggccgc 18780 
cacgctggcc aacctgctgg gcgccccgca 18840 
cgaggtcgcc gggatcacca cctggcagct 18900 
ccgaaccctc ggcgacacgg agatgcgcac 18960 
ccgggacggc accgtgcacc cctcaccgcg 19020 
ggtggtggtc gtgggcaccg cgaaggggct 19080 
ctgatccggc ccgtcccgaa cactttcccg 19140 
gcggtccagc gcgtgccgat cccctaacaa 19200 
cacgttgcaa gacacggcga tctccctaat 19260 
catcctcggg agactcgcat ggaaaatcgg 19320 
cgggctggcc ttcggcaccg gtggcctggt 19380 
cctcgcctcc gagatcggcg tggtcctgct 19440 
gggtgaactc gtcaccggct tacgccgctc 19500 
gaacgcggta cccggtgcgc tggtggccct 19560 
cacaatggcc ggcgtcacct acatctcttc 19620 
cctgggtcgg ctcggcaacc gggaaacgcc 19680 
cctggcgatg gccgtctacc tgccgatcct 19740 
cggcggactg accgcggtcg gggtgtcgct 19800 
gctgcgcttc ggcaagtacg tgtccgcgct 19860 
gctgcgcctg ctgggggcgg cgctgctcgt 19920 
ggcggcggtc ggctcgttcc tgctgggcat 19980 
gacccgcatg ctggagccgc tgcgcgacct 20040 
cctcaacacc gatccgagcc agatcccgcc 20100 
ggcgacctgc gcgacgaaag tgctcaccgg 20160 
ccggatgggt cggctgcggg ccggtgccgc 20220 
gatcgccggg ctggccgtcg cctccggcgc 20280 
cgcctacgtg ctgctgatgg cgatcgtcgg 20340 
ggcccggatg ttcctcccca gccaggcgaa 20400 
cccgagcgga cgggccggag gctttctcaa 20460 
acgcaacacc ggtgcgcgaa cagcctcaga 20520 
cctggcggtt caacggccgg atcgcgcccg 20580 
tccgcgtccg caccaggcgc tgcaccgggt 20640 
tgcggttgcg gccctcgtgc aacacgatct 20700 
ccagcttgaa ccggtccacc ttcaccagac 20760 
gctgcccgag gtccttcggg atcgacccca 20820 
tgtacgacgg gtgcatcagc cggttcgcca 20880 
cctcggtgtc ctggtccagc cgccccacgt 20940 
aatcaccgat gcacggacgc cccttgtcgt 21000 
tgttcagcag caggtgcgtc agatcgtcgt 21060 
ccaccgcggt cgtcgggtcg acccggcgcc 21120 
cgatccgccc ctcgacgatc atctcctcgg 21180 
acaacacttt ctgcagccgc acgccttccg 21240 
gctcagacgt catcaatcga atccacttcg 21300 
ttcaacgacg acagccccag ccgctccagg 21360 
cccgtctccg ggtcggttcc ggcctcttcg 21420 
acgccgtcga cgttcacacc ccgtactgcc 21480 
tacgcgatca ccgcgagggt ctccagcgcg 21540 
aacaggtacc gctccacgta cggggcgtac 21600 
ccgacccgcc gcaggtcaac gccccgttcc 21660 
agcgcggacc ggatccgggc caccggctgc 21720 
gccggcacgt cgaccacgag cagcaacgcc 21780 
gtcaggtccg gcgcactcgg cgcctccacc 21840 
tccaacggtt cctgcggctc agcgggcggc 21900 
gcggcctccg gctccagctc cgtcacccgt 21960 
cctcggcttc ggcctccgcc tgcgccacgc 22020 
ccagcggggt ctcctgctcg aacagcagga 22080 
ggaaccgcgc caccacctcg acggtgtgcc 22140 
cgctgccctt ctccgccaac cgcacccgca 22200 
cgtgctggtg gatgtggtcc agcgaaacgg 22260 
ccgcgatctc ggcgaaccgc tgcggcggaa 22320 
cgaaccgctc ctcgaccgac accgaacgcg 22380 
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gatagcgccg cagcgccccg gcctccaact 
cgcggtactg cagcagccgc gcgaagagca 
cctcgtcctc gacctccgcg gccggcagca 
cggcgaccac caggaactcg gtggtttcgt 
tgtaggcgat gaactcgtcg gtgaccttgt 
gctgcgagat cagctgcagc agcaggtcga 
tgaaccggcc gcctgcggcg ggatcctcgc 
tcactctccg ggctcctgca ggcgccgcac 
gtcggcgagc accacggcaa ccgcttcgcg 
gtggtcggcc cgcaatgcca gccgcgcctg 
gtacacggtg atcttcgaat cgtgcttgcg 
tgccgcctgc tccgggtccg gcggattcgc 
caattcggcc gctccgggca gggcgacccg 
agtgcgcggt acgcctgcgc cccggccgag 
gccaccgtgg tctccggaaa ccgcaccgtg 
ccgaacgcct cgacgacgcg cgccatcacc 
atggtcgcca ggatgccgct gatctccagc 
atggtgtcga tcagcagcgc caccccgcgc 
atcacggcgt ccgcggcggc caacgcgttc 
tccaccagca cgtagtcgta ctccgccagc 
tcccggccca cctccgcgac cagctgcacc 
agcaggtcca tgccctcgac ggtggtctgc 
tccatgatca cgttgtagat ggtctggtcg 
gagagcgcgc cctgcggatc gaaatccacc 
gctgcgccga ggttgatcgt cgacgtggtc 
gcgagcaccg atgcgggacc gtgccggtcc 
ggccggccag tgggaccgag gccgcggggg 
cccgccgtct cccgggcgat gctcaggtcg 
ggctgcggca gcgacatggc gatctgactc 
ctaagtggca ttcgttacca gcggcaacgc 
atctgcgtgt gatcaacgtc taccgcgtac 
gtcactccgt agcgtgtgcg cgcggatgcg 
ccgtgatcaa ggtgtagacc tgggtcgtcg 
ccacgcggac atccgcgccg ccctccacaa 
gcggcgaaac cgagccgttg atcccggcgc 
agctctgccg ggaaagccgg gtaccgcgcg 
cgcgcgcggc caggccgggc ctggcccgta 
gtcggccgat gggcaccagc cgctggcggc 
tggtgccgtc gatgtcgtcg aggtccaacc 
aatacagcac ttccagcagc gcccggtccc 
gatcgagcag cttcccgacg tcgccgatgg 
tcggcggcgc gacatcgcgc gctacgtcga 
ggtgcaggcc gcgggcggcc accaaagctc 
gctgcccggt gccttcccgg agctccgcca 
cgccgagccc agaaatccct gccgcgacca 
cgtaggagtc gagagtgctg cgggcggtac 
cggtgatcgc cccgcgcagg tccggcggca 
gcgctggggc cggtccggcc attcggcatc 
gaccgcgtgg gcggcgagca gccccgccac 
cgccatccgc accgcctcgg cgagcgggaa 



<210> 4 
<211> 45624 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 4 

tcagccgacc ggcttccgcg ccgtcagaat 
ataatcgcag accgcggcgg cccagccggc 
gtgctggtgc gcagcgagct cttcggcgaa 
cacgtcctcc caatcgagga tctcgaaccc 
agccaggcag atccgaaggc cggtcggcca 
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cgccgaacag cgcggccacc tgcttgtacg 22440 

ggtcgcgggc ctccagcagc gccaggtccg 22500 

gccgggccgc cttcaggtct aacagcgtcg 22560 

tcaggtccca ccgctcgccg agcgccttgg 22620 

gcagcgctac ttcggtaacg tccagctggt 22680 

aggggccctc gaagttgtcc agccgcaccg 22740 

tcgactgcgc cgccggtacc gcggggtctg 22800 

cagcaccgag tccgcgccgt tcttgtcgag 22860 

caccacgcgg cctcggtcca cggcgatgcc 22920 

ctccaagccc agcaactcct cgtcggacac 22980 

ccgtccagag ccgccccggc ccgccgcggg 23040 

gtgatccgtc cgaggaacag tagtacggaa 23100 

acgcgtcatc gggcgatcac ctcgcgcgcc 23160 

cggggcgccc agcgggtgat cggctcgccg 23220 

cggttgatca ccgcatcgaa gacgatgtcg 23280 

tcgcgcgagt gcagcgtgcg cgggtcgaac 23340 

ctcgggttca gccgctcccg caccttctcg 23400 

aggctgaaga actcgcactc cagcgggatt 23460 

accgtcagca gccccagcga cggctggcag 23520 

gccggctgca gcacccggtg cagcgtctgc 23580 

tcggccgcgg acaggtcgat gttgctgggc 23640 

cgcaccacgt catggatggc gaccggccgt 23700 

agctggtgcg gctggacccc gaggccgacc 23760 

agcagcaccc ggcgcccgta ttccgccagc 23820 

ttcccgacgc cgcccttctg gttgcacatc 23880 

agcagcggcg gctcggggat gtgccgacgt 23940 

ttcggcgcca ggccgacctc ctcctcgcgg 24000 

accgcgcccc gggaccgccc ttcggcggcg 24060 

cttttggtgc cggtggcgat caaacgcagc 24120 

gcctcgccgt ggtgtcggaa gttccctgtg 24180 

accaccccct cgcattcgca gcgtcaccaa 24240 

tcgtcgcgta gacctcccgc agcgtgttca 24300 

tgaccgaggc gtggcccagc agttcctgca 24360 

ggtgcgtggc gaacgaatgc cgcagcacgt 24420 

gctgcgcggc ggtcttcaac gcgttccacg 24480 

aattcaggaa caccgcggca ctaccccgcc 24540 

ccaggtacgc gtccagcgcc gccagcgccg 24600 

cgcccttccc ttccagcagc acggtccggt 24660 

cgaccgcttc ggagatccgc gctcccgtcg 24720 

gcaggccccg ggcatcctcg cccccggcgt 24780 

aaagcgcctt cggcagccgc ttcggcgggc 24840 

ccggcagcag cccctcggcg tgcgcgaagc 24900 

gggccgacga cgacggcgcc agcgcgggcc 24960 

agaacccgga caggtgctcc gaacgcacct 25020 

ggaattcggc ataccggcgc aggtccctgg 25080 

cgcgctcgac ggcaaggtgg tcgaggtacc 25140 

gctcctcgaa acccctcacc gggcggcgaa 25200 

cgccgggcga ggctccaccc ggccctcgcg 25260 

cccggggccg ttgacgatct ccccggccag 25320 

gcggcggatc 25360 



cgcatagccc atgtcgtggg catatttctc 60 
gacagccggc ccgtacctgt ccgcgatccc 120 
ctgcggcatg aagtaccggg tcctcgacga 180 
cgctgcacgc agcgattcca gaagttgctc 240 
cctgtccccg gacaccggca tcccgccgcc 300 
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cgcttctcgt ttgacgacct cggtgacgcc 
gatttcccgg atggcacggt ccggttcgga 
ggcgtcgaaa gcattgtccg ggtacggcag 
ccggtggctt agtccgcgtt cgcgtgcgca 
gacggtgatg ccggtgatct ggatcgcgtt 
ggtaccgcac cccacatcga gcagtcgaac 
aaggtcggtg agccggtcgg cggcctgctg 
gccgtggtgg atggcgcagg ggccgcccgc 
atacatctgc ccaacctgct gcgatgttgg 
tctctgcaac gttcaccttc tcgtaatttt 
aaagacggca aactagccgt cagttaattc 
tataactatc cggccaggag gcaaacagcc 
aaccaacggt gatcaccgta caggtccggc 
cactcgttcg cgtgaaacca ggtgcggcga 
cctgccgtga cacactattg ttgccgccgg 
tacgcagcgc attttccaaa tcctggccaa 
gaaagagact ggggtccgaa gtatgcgcgt 
tctcatggca atggtgccgc tgtgctgggc 
cgccgcacca ccagagctgc aggcgaccgc 
ccgcgggaac gacaggaccg gcgatacggg 
ggcgttcggt cagcgcgaca ccgaggcagg 
tgtcgcgcaa agctcgctcg atcagctccc 
accgtcagtg ctgttggtcg acgtctgcgc 
cgacctgccg gtcgtgctgc accgctgggg 
tcgagcccac gagttgcttg acccggtgtg 
tcccgagctc atcctcgatc cctgtccgcc 
cgcaccggtc cagtacgtgc cgtacaacgg 
gcgcacctca gcacggcggg tctgcatctg 
gccggctccg ctgctgcgcg cagtagcggc 
gatcgccgtt ccccccgagc accgggcact 
cgccgaatcg gtcccgctca acctgttcct 
cggctcggga acggcattca ccgcgacccg 
gtacttcgac cagttcgact acgcgcgcaa 
gccggatgag caggcccagt ccgaccacga 
cggcgacacc ggcttcgctg ctgcggcaat 
ccatcccgcc gcgctggtgc ggacgctgga 
tagtcacccg agaacagctg gatccggaga 
ggacctgctc aagaaggtac tcaccaacac 
ctggcaggac aacaccgact acaggcaggc 
tgtcgcgcac acgatgatcg gtctggagcg 
cgtgctcgca gacggtgttc ccggggattt 
atgcatcttc atgcgcgcgg ttctccaggc 
agtggattcc ttccagggaa tgccggaaag 
tatggcgctg cacgagtaca acgacgtgct 
cttcgcccgc tacgggctgc tcgacgaaca 
caccttgccc accgccccca tccaggaact 
cgaatccaca atggactcat tgcggaacct 
catcatcgac gactattttc tgccgtcctg 
actcgggatc acggaaccca tccacgacat 
ctggtgaacg gctcagccgt cctcgacgcc 
ggcaggctca gcgttgagcc ctttctccac 
cgccacgttc cgttccgtgt aggagggctc 
atgcaggccg tcgatcagtt ctttgagcat 
cccatcgcga ccgccgtagc cgggccagta 
accacctggg cggacgtgcg gaaacaggga 
atggctgccg tcgtcgatga caatgtcgaa 
gaattccgca tcgctctggt cacctcgcag 
tttctcgaaa atgtccagcc cgtaaacgag 
cagcgaagca ccaccgagtt cgggtgcgtg 
caccggggca tcctggaatc gggagaagtg 
gtccgcccat ttgtcggatc cgtagcggac 
ggcggctgcg gccagcacgg cgtgcaccgc 
gcgctgcgcg ccggcgaggc cctggtcggg 
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gaggatgcca ccgggtttga gtactcgaag 360 
catctccaac agcgactgca tggcccaggc 420 
ggacatggca tcgacgcacg agaagtccac 480 
atcagcggcg atggccactt gcacctggct 540 
gtcgcgcgcg acgcgcagcg ctggttgtcc 600 
gccgccatcg agcacggtcc gttcggcgac 660 
ccaggaagcc cgcccgtcgt tctcccagta 720 
gaccgagttc agcaacggcg tgaccaggtc 780 
tgcgccacct ggcaacaccg gtatgcctga 840 
ctggaaccga gcaggaaaca cacagagaga 900 
gcggttaccg ccgcatgcgg cggtaccgac 960 
agcgcaaatc ttccccatga atccccgcag 1020 
ccaccccgat cgagcggagc tcgcgcattt 1080 
cccaaagatc ttcgatggtc gcgtgacatt 1140 
aagtccgttg gacgtcaacg gcgatcgaca 1200 
tgggagtagg ggctgctgcg ctaatcgggg 1260 
actcgtcgtt cccttgccct atccgacgca 1320 
gctgcaagca tccgggcacg aggtcctgat 1380 
gcatggtgca ggtctcacca cggccgggat 1440 
tggaaccacg cagctgcgct ttcccaatcc 1500 
ccggcaactg tgggagcaga ccgcgtccaa 1560 
cgaatacctt cgactggccg aggcctggcg 1620 
gctgatcggc cgggtgctcg gcggattgct 1680 
agtcgacccc accgcaggcc ccttcagcga 1740 
ccggcaccac ggactgaccg gcctgcccac 1800 
gagcctgcaa gcaagcgacg cgccgcaagg 1860 
aagcggcgca ttcccggcat ggggcgcggc 1920 
catgggccgc atggtgctga acgccaccgg 1980 
tgccaccgag ttgcccggcg tcgaggccgt 2040 
tctcaccgac ctacccgaca acgcccggat 2100 
gcgtacctgc gagctggtca tctgcgcggg 2160 
actcggcatc ccgcaactcg tgcttcccca 2220 
cctcgccgct gccggggcgg gcatctgctt 2280 
acagttcacc gactcaatcg caacggtgct 2340 
caaactcagc gacgagatca cggccatgcc 2400 
gaacactgcg gccatccgtg cctgatgaac 2460 
accgatgccc tcccagaacg cgctgtacct 2520 
gatttacagt gatcggccgc atccgaacgc 2580 
cgctcgggcc aaaggcacgg actggccaac 2640 
gctggacaac ctccagcact gcgtggaagc 2700 
cgccgagacc ggtgtctggc ggggcggcgc 2760 
attcggagat accggacgta ccgtctgggt 2820 
ctctgcgcaa gaccaccaag cggaccaggc 2880 
tggcgtatcg cttgagaccg tccggcagaa 2940 
ggtcaggttc ctccccggct ggttccggga 3000 
cgccgtgcta cgactcgacg gcgacctcta 3060 
gtacccgaag ctctcgccgg gcggattcgt 3120 
ccaggacgcg gtgaaggggt tccgcgcgga 3180 
cgacggcacg ggcgcctact ggcgccgcag 3240 
gagcgcttgc cggggcacga accccggcgc 3300 
gaataccagg ttgtggtaga agtgcagggc 3360 
ggtcccgcac cgcgattcgc gctcctgata 3420 
gtcgatcgag gtgcgctggg ccgcgggttc 3480 
cgccgtctgg agatcctcga tgacgtacaa 3540 
ttggaaggat ttcttgacgt ggtcgttgac 3600 
cggcccgatc ttccccgcca tgtcttccag 3660 
ctttcgcact cggtgccctt cgttcccggc 3720 
acctcgccgg aagtaccgct gccacatgcg 3780 
ataaccaccg attcctattt ccaacactcg 3840 
gtgctcgtag tgttcggtgt accagtgcag 3900 
tgccagcgca ccgaggtccg cgcgcctggt 3960 
ggtggacacc ctgttccgga acgccagcag 4020 
cgagaactgc gtcatgctgt ccgaccagcg 4080 
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gacttcacgg ctgttgtgcc gcctgccgtc 
caccgcgtcg aaccggagaa cagcagctgc 
gtgatcgagc tcaacggttc tccggaccag 
tcgcaccgac tcgtccaacg acagcggatc 
gtcggccacc gcgtccagtc cggccacctg 
acccgccgcg agcagcaaac gctccaccac 
ggaaagtcat caccttttcg gttccagcgc 
ccacaaagta gagccggcgg aggcgaccgg 
atcacccgga cgcgtagatg atctcggctg 
gaggggcggt gcccgaattc catgacccgg 
agatcaccgt cgagcccgac gatcctcgtt 
gtttcaccgg aaaacccgaa cgcatccaca 
ccgtcgccga cgccgtgcgc accggcaggc 
tcgagaatct cgttgcggac ccggcgatcc 
gcgtgtacta cgacagcacg cgcggggcat 
aggtgtaccg aaccctgttc aagaactggg 
gggtgggcgc aggcgggcac atcctcggcg 
gttcggtcgt cgactacctt caaggcgtcg 
tgcacatcgt cgaggccgac cggaactcca 
acaccggtgg cggtggcggc aacttcggga 
acgtggtcag caccgacgcc gcagagctcc 
gatcgttcca ctggccgtgg cacgaactga 
acttcggcaa ttggtacgag cagcacagcg 
gcacgctcgt ctgcgcacac cggcaagctg 
gcacggatcc gaacgcggaa cgcaccctgg 
tcggcgtgac tccagccgaa gggctgcggg 
tggccggggc gatcgccgaa ggcggcgaac 
cctacttgcg caccgggctg tccgaagctc 
tctacggata cgacaaccct gcggcggcgc 
atgccgtggc tccgtcggcc accgcactcg 
tcgtcacgaa ctggtcggag cccgccgagg 
tctaccgcga gatgtacgcc gaaaccggcg 
gctcctacat caactacccg gacaccgact 
ttgcctggca cgacctgtac tacaaagaca 
ggtgggatcc gcagaacatc ttccagcacg 
cacccggtca gccatgagga gctcatcacg 
gaacgcatca tcctcgccgc cggatccagt 
ctcggactcg ccaggatcgc acccgtgctg 
gcccccgaca tcgaacggac cgaggtcgcg 
gacttcgtcc tgacgctaca gtccggtgag 
gacgtcccgc tgcggatcgg ttacgagctc 
ggagctccca gggccgtcgg cgcccggagc 
tcgatacccg gtccgtcgga actgtccgat 
ggctgcgggc accgacgtcc cgacctcaac 
tggggcggcc tgcactggtt caccccgcta 
cgcccggtgc gcatcctgga gatcggtgtc 
gaatccctga agatgtggaa gcgctacttc 
ttcgacaagt ccttcctcga ccagcagagg 
cccgaggagc tggccgccgt tgacgacaag 
ggcagccaca tcaacggaca cgtgcgcaca 
agcggtggcg tatacgtgat cgaggatctg 
caggcgcagt gcccggccgc acccggcacc 
ggcgttcagc acgaggagca gccgcatgcg 
aatttggtcg gcctccacac ctaccacaac 
gaaggcggcg ttcctgcttg ggtgccaagg 
gtgaacagcg cggaggacga gtgaacagca 
cggatcaggt ggagtccatc ttcgatgcgt 
actgggcggg cgggtatcgg gaggatgccg 
aactgaccga cctgttcatc gacaaggccg 
tgggctgcgg caatgggcag cccgtagtcc 
ccggaatcac cgtgaacgcc cagcatctcg 
gactggccgg cagtcttgag ttcgatctag 
gtttctttca ggccgcatgg gcgatgcagt 
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aaccgggccg aagagccctt ccagcagatc 4140 
ttcggcgacc gccgcgagcc gcaggcctgc 4200 
ctgagcgcca gaagtgatct ccaggccgat 4260 
gcagcgtacg acgagttcgt cgacgatggc 4320 
cactgcttcc tggagcttcg ccgggcccgc 4380 
cgaccagggg gcaactgcga tctcactcat 4440 
atacggcacc atgcgcaccg cgatatcgat 4500 
gtttcacgta tgagctgaaa cacacagata 4560 
cgggcgaaca aagtggacca gtcagaaaag 4620 
cacccatgaa tcgtcgaacc ccaggaacag 4680 
atccggacct cgtcgtcggg cacaaccccc 4740 
tcgccagctc cgccgaagac gtcgtgcacg 4800 
gggtaggggt ccgcagcggc gggcactgct 4860 
gagtgctcgt cgacctctcc gagctcaacc 4920 
tcgcgatcga ggcgggcgcc gccctcgggc 4980 
gcgtgacgat cccgaccggc gcatgtcccg 5040 
ggggatacgg cccgctgtcg cgccgattcg 5100 
aggtcgtcgt ggtcgaccag gccggtgaag 5160 
cgggcgccgg tcacgacttg tggtgggcgc 5220 
tcgtcaccag gttttggctc cgaacgccgg 5280 
tgccacggcc gcccgcgaca gtgctgctcc 5340 
cagagcagtc attcgccgtg ctcctacaga 5400 
cgcctgaatc cacgcaactc gggttgttca 5460 
gctacgtcac gctgaacgtt cacctggacg 5520 
ccgaacacct gtcggcgatc aacgcccagg 5580 
aaaccctgcc gtggttgcga tcgacccagg 5640 
cgggcatgca acggaccaag gtcaaagccg 5700 
aactagccac ggtttaccgg cggctgaccg 5760 
tgttgctgct cggttacggc ggtatggcga 5820 
ctcagcgcga ctcggttctc aaagcgctgt 5880 
acgagcggca tctgacctgg attcgcggtt 5940 
gagttccggt gccaggtacc cgtgtcgacg 6000 
tggccgatcc attgtggaac acctccggtg 6060 
actacccgcg gctgcagcgg gccaaagcgc 6120 
gcctgtcgat caaaccgccg gcacggcttt 6180 
atgtccacaa cgcacgagat cgaaaccgtg 6240 
gcggcgagcc tggccgacct gaccaccgaa 6300 
atcgacgaga tcctcttccg cgcggaaccg 6360 
gtccagatca cccaccgagg cgagaccgtt 6420 
ctgatcaagg ccgagcaacg accggtcgga 6480 
accgatctca tcgccgagtt gttcggccca 6540 
accaacttcc tccgaaccac cacatccggt 6600 
ggcttccagg ccatctccgc agtggtcgcc 6660 
ttgctcgcct cccactaccg cacggacaag 6720 
tacgagcgac acctcggcga gttccgtgat 6780 
ggtggctaca acttcgacgg tggcggcggc 6840 
caccgcggcc tcgtgttcgg gatggacgtt 6900 
ctctgcaccg tccgcgccga ccagagcaag 6960 
tacggaccgt tcgacatcat catcgacgat 7020 
tccctggaaa cgctgttccc ccggttgcgc 7080 
tggacgacct atgctcccgg attcggcggg 7140 
acggtcagcc tgctcaagaa cctgttggaa 7200 
ggctcgtacg agccgagcta cctggaacgc 7260 
atcgcgttcc tggagaaagg cgtcaacgcc 7320 
agtctggacg acatattgca cctggccgac 7380 
gaggggcgaa cacacaggca tttccgaccg 7440 
tggcgcacgg gcgtcccctg caccacggtt 7500 
gtgccacacc gtggtcggat gctgccgacc 7560 
cgctccgtcc cggagcgcac ctgttcgacc 7620 
gtgcggcatg cgccagcggc gttcgagtca 7680 
ccgccgccac caggctcgcc aacgagaccg 7740 
tcgacggcgc ccagctgccc tacccggacg 7800 
ccgtcgtgca gatcgtggac caggccgccg 7860 
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cgatccgcga ggtccaccga atcctggaac 
tcactcgggt tcgactcccg gaagagtacg 
ccttgaacag cttcacggcg ctggtcagcg 
acctcacggc acagaccagg tgcatggtct 
tcgatgagct cgccggcgtc gagcctgcgg 
gagacatcgc ggcgaagcac ggaccgggac 
accggaaaca tccggattac gccagaaacg 
ctcgaaagaa gcagtcctga tggcctccga 
ggcacccgcg gatgatccct tctaccgacc 
cacgctcctc agggcccggc ccgtctcggc 
caaggcctgg caaatcctct accggtccaa 
cggcaccgtt ctggtgccga acatcccgtg 
cgcagtgggc acccacggcc tcggtagcca 
aaccgagccg gagaccgagc tgatcgccgt 
caccgactac gaaggcctcg gtactcctgg 
gggacacgcc atgctcgatg ccgcccgcgc 
gaccgactgc ccggtcggca tctggggcta 
cggcgaactg caccccacct acgcacctga 
ggtgccgatc gatctgctgg acatcatcca 
gctggccggc ctggtcgggc atgccgctgc 
caccgaagcg ggtcgtaccg ccgttgatca 
cacccgcttc ctcggccgcg agctgagcga 
acctcgatgg cgcgcacgac tggccgaaag 
ccccacgctc gtctaccaca gcacggacga 
actccgggac agctaccgcg cggcgggtac 
ggctcacttt cccgccgccc tggccagctc 
cttctccgag ccgtccgcga tcagcggtcc 
gcctctccag cggcttcccc gctggcttcc 
ctacccgcgg atgccgggcg gctatgtgga 
caccagcgcg gcctgctgga tcatcgattc 
cacgcctgtc cctgctctga tgtcgcgtgc 
ttccgctggc aaggacaact cgtcgacaac 
ccactcggcg ggcggcgtga acgcccggtc 
ctcgtacgcg ctgcggtagt ggtgcacgaa 
ttccgattgg agcagcacgc tgcccgacaa 
gcttgcgccg agaaccgtga gcggaccgag 
gacaccgatg tccagcaacg ctccgccacc 
gggaagcggc ggaattccga acacggcggt 
ttgcagcagg tcgcggacca cgtcgtgccg 
cagcagcagg ttcttcctcc tggccagccc 
cgtcagcggt ttctccgcaa gcacgtgttt 
tgcatgcatg ccaggcggca acggcacgta 
ccgctggtaa cccagcaccg cctcgcattc 
cggatcacgg ctcgccaccg ccaccacctc 
catccgtcgc cacgcgaagg aagcgcaccc 
cgagctggtc atcgccccaa cgcccacaag 
tggatgttca agaagtgggt gctccggagc 
cggaagtcgc tgggaggtcg tgccgcgaag 
ttctgcgcct ggtagaaccg accgccttct 
tcgggatcgg cggtgagcac gtcgtccaag 
tggtagttgg ccacactgca ctggaccgtg 
gcctccaccc gagcgtggac caaaccgtgc 
gcgatctcac cttgttctcg cggttcgatc 
ttggtcgcgg acaccgacac cgcgatcacc 
tcggtatcgg tgcgatacca cttgtcgacc 
cggtagcggg ccttggcttc ctcgaaccaa 
atgccgtgcg agagcgacct ggccaccgcc 
tcgggtccgg tggcggaatc gtggaacggg 
ttgacgatgt tgtcctgacg aaggagagcc 
tctggcagga ctggcacttc ctcctggact 
tagaaccagg acccctgttc agactggagc 
ccgaggaagt agtccacata gggcggaacg 
cgagttgcct gaaccgtcgg cgagagctgc 
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ccggcggccg gttcgtcctc ggagacatca 7920 
cggcggtttg gacgggcacg accgcccata 7980 
aagccgggtt cgagattctc gaagtcaccg 8040 
cctggtacgt cgacgagttg ctccggaaac 8100 
ctgtcggcac ctaccagcaa cgctacttgg 8160 
cagcacagct gatcgccgcg gttgcggaat 8220 
aggaaagcat gggtttcatg ctcctgcagg 8280 
gcacgccagc ctggtcggcg acgatctgcg 8340 
gccgacgccg ctaccgccgg gtgtcccggg 8400 
actgcgcggc acgggcgaac ccgtcgcagc 8460 
ctccgccctt ggcatgccga acgccgtctc 8520 
gccgcgcgaa gatcgcccca tcatcacttt 8580 
agttgccccg tcgtacctgc ttcgaaccgg 8640 
ggccctcgac cgcgggtggg ccgtggtcat 8700 
aacccacacc tacaccgtcg gcagggcgca 8760 
tgcgcaacgg ctaccgggct ccggcctgac 8820 
tgcgcagggt gggcaagcgt cggccttcgc 8880 
actgcgaatc cgcgctgcgg ccgcaggtgc 8940 
ccgaaatgac ggggtgttca ccgggccggt 9000 
ctaccccgat ctgccattcg acgagcttct 9060 
agtgcgcgag ctcggtgcac cggagctcgt 9120 
cttcctcgac acttccggcc ttttcgagca 9180 
cgtcgcaggt aggaacggtg gcccggtggt 9240 
cgagatcgtt ccgttcgcat tcggcgagcg 9300 
gccagtgcgg tggcatccgc tctccggatt 9360 
gcgagtggtc gtctcgtggt tcgacgagca 9420 
gcgagatgcc aggtgagcgg atggcggtga 9480 
ggatgcccgg tcaggcttcg aggcgaacta 9540 
cccgcaccga cgggcctgac cgatcgcctc 9600 
gcccgaatct ccggccaccg cagggtcgtc 9660 
gaaggcggtg accgccttgc gaacctgatc 9720 
gcccttccgc tcgattcgga tcacggcctg 9780 
gacgacgatt cgcccacgac tcccccacag 9840 
accgtatccg aggtgggcaa cggtgccacc 9900 
gtcgacgccc gactcctggg cctcgtgcga 9960 
gagaaagtgc cgagcggcac gggcgggata 10020 
gagttcggtg cgatagcgga tgtccgtgtc 10080 
gaactcccgg agctcaccga tctccccgga 10140 
gccgtggtgg aggaacaggt aattctcccg 10200 
gaccaggcga gcggtgtcgg acgccgtcgt 10260 
gtctgcctca agcgccttgc cgatccactc 10320 
gacggcatcg atgtccggcc gctccaggag 10380 
gaatcgcgct gcgaaccgtt cggctttcgc 10440 
tgtttcggcc acgtcgcaca tcgcgggcag 10500 
gagcacaccg atgcgcaccg gctttcgcat 10560 
ctatgcaggg aggcgaccaa gctgcgcgcc 10620 
agctcgccca actgtcccaa agtcatccac 10680 
tcctcatgca cctcgatgat ccggtacctg 10740 
tcggacagga tcgattcgta ccgcacggtt 10800 
aacggcggcc agtcgttcga cggggtgctt 10860 
ggagcgattt ccgcagtcca cttataccca 10920 
agcactccgc cgatccgctt gaccaacagt 10980 
atcggctgcg tccagctggc gacctcacga 11040 
gcgaagtact tgccgtcctg gtgggcgatc 11100 
ctgctcagcg gaacgcgcgt tgcccgcaag 11160 
ccgaccgcct cggagatact cgacgaatcg 11220 
tgtcggaagg gctcctccga ggcggctagt 11280 
atgcaagaca gcaccgtccg ggtgtccatg 11340 
agcacctggc cgagcgtcaa ccagcagaag 11400 
tccaccacca tgttccggtt gcgcttccgg 11460 
acgtctacca gcacgcggcc gcggccccgc 11520 
ctgccacggt gtgcctgcgt gtagttgctc 11580 
aggacgttga cgttgcccgg ttccatcttt 11640 
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gctgacatga ggcagtgcag cacgccgtcg 
acttcagcct ggttgatgat cggctggtgc 
acctgcaggc cttctaccgt gaagaaccta 
gcatcgaatt tccatttcga caggcggtcg 
tcgcccagcc gatcggcgag ccagcagtgg 
gcgctcagcg cagagtcgac gaaccgccgc 
cttgcctcag cttcgacaga actgctcatg 
cgccggctcc caggggcgtc gcggtctggt 
tccgcccgta aggaccgcga gcacctcgtc 
ggagaagaca cggacgtcgg ccacggcctc 
caaggtgacc ttcggctcgg catctaccac 
tcctggcgca cagcggtatg tctcgaccgc 
tcctcgtcgg aaatcggttg gagcgcaaac 
ttctggacaa gaatctgact tcgggactcg 
gagtcgacgc aacagcgatg aacacctgcc 
gttgctcacg gatggccatc agactgccca 
ttttccagac gtggaacgat gtcgtgcggg 
agcgccgcgt tggcagcgaa cgacggctcg 
tcgtcttcgg taaacgtcgc ggggtcgagg 
tccgccctac gagattcgtc ccagaaggtc 
accgtggcgg tttcctgcgt cgtcgtcgag 
agcagttcgt tgagcggtac gtattcgtgg 
tcccgtactt cgtcgtcaga caaggtggcg 
cgcaacaacc tttccaccat tgcttgttcc 
accctgctca gccgccgctt ggtcagcccg 
cgttcgcgca accactccgg caacaccgcc 
atgtagtcca agtccacagg caggcgcatc 
cccgtcacga tctcttcatc gaacgtggct 
gtcccgagcg ggtcgacgcg ctgctcaggc 
gaccgcagcc accccgacac atcgagggcg 
gctcgcgcca caactggccc cgaacacacc 
catttctcgg cgaaccccat gagatcgtcc 
tccacggcgt ccgtgtctct gccggagttg 
tgccgcccct tctcgtcgaa cgagaccccg 
aaggagatga ggtcgagctc gtcgccgacg 
atcgacacca tattcgggca gatggcgaca 
cacgccaacg gcaccaggtt gaagaagtgc 
cgcatgaact ctccgatcgc aattgaacac 
tctgatatcc cccgggaaac gccgctttcg 
gtagcgatct tgacaagcgc gagcaggtcg 
gtgtgccggg tagatcgagg atttctgagt 
tgatgtcccg caccgcatcg aaccagggca 
gtcccgctgg gacgtcataa cgcgattcgc 
gcgcgaagac tgatcgtcgc ctgcatgcag 
cgcgccagag tgacttcgtg tgacgacact 
gccaaggacc ggatattggc atccgtacgc 
ttcgtggctg gatcgacgcc gatccggcca 
gtggcactgg tggaagccgc gctggagctc 
ttcgagagcg agttcgcccg cttcttcggc 
tcgtcggcca atctcctggc actgagttcg 
ctgcggcccg gcgacgaagt gatcactgcg 
gcggtccaaa acggactcgt cccggtattc 
acgccagacc gcatcaaggc cgccgtcacg 
accctgggca accccttcgc cgctgacgaa 
ttcctcgtcg aagacaactg tgatgcggtg 
accttcggcg acctgacaac ggtcagcttc 
ggtggctgcg tgttgaccgg cagcctggaa 
tggggacggg attgctggtg cgagcccggc 
taccacctcg gtacccttcc accgggctac 
tacaacctca agaccaccga cctgcaggcc 
tccgcattcg ggtcggcacg ccgccgtaac 
ttgccgggcc tgctgctgcc ggtagccaca 
gcgatcacca tcagtgcgga cgccgggttc 
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atctccttga cgagaatgcc gaggatacct 11700 
cagctgggtg ccgcgccata gttggtctcg 11760 
ccgtcggcat gaaccaggtt ctccgtgctg 11820 
aacgggatgc gattggtctc gaagctgttc 11880 
aactccgtgg tcggaatcat gccattgcaa 11940 
gtgttgttgc tgccgagcgg cgcagcagca 12000 
cccaatccct cactgccgtg acgatgtgca 12060 
ccagcggcct ggcgaggatc atcgaagcga 12120 
ctggcgttcg gtcatgaaga aatgcccgcc 12180 
ggtgtgctct cgccaggctt ccgcctcatc 12240 
cagcacggtg atcgggcaac ccaccttcgc 12300 
acggtagtcg ttgctcagcg ggcgccacga 12360 
ctctggggag atcaacaccc cattggacga 12420 
cctggcatcg tcccggtcac ggcggaatcc 12480 
aaaccccggc gtccatgttt ccggccacac 12540 
gtgcggcggt tctcacgttc ggcaactagc 12600 
ctgggacttt cctcgatttc acggcggatc 12660 
tcgagcaggc gggccagctg acctcgcacg 12720 
accagaccgg ctccccgatc ggctaggagc 12780 
ccagggagaa tcaactgcgg tacgccgttg 12840 
ccatgatgga tgatcactga acacgactcc 12900 
acccggacgt tcgagggcaa ctcccccatc 12960 
atcacctcga cgtcgagcct ggccgcgccg 13020 
tggtcacttt gctccccgta ctgttcggtc 13080 
cgcgtgatgc agacgcgcgg cttcgtcggt 13140 
ggaccgttgt acggcacgaa acgcatcgag 13200 
caggatggaa tcggatctat ggtcgcttgg 13260 
ccgaacttgg cgagcttcgc tcccagccac 13320 
ggcttcgatt cctggtattc gaggaaaccg 13380 
acgagcatcc gcacgtgtcg tgcgccgagc 13440 
atagcgtccc acacaacgag atcaggctgc 13500 
agcgatcggt catccacaag gtggagttgc 13560 
attgacagca gctcgtcgaa gagttccgga 13620 
ctgccgagaa cgagttcgtt cttggccgcc 13680 
ggaaccgcgg tgagtcctgc tccggtgacc 13740 
cggacctcgt gccccgcggc acgcaacgcc 13800 
gaactcgccg gcagcggggt gaacaggaca 13860 
ccgggaagca tgccaagaat cacagaaatc 13920 
caagccaaat cttaggcctt ccaggtgatg 13980 
ttcccgctag cctgggctct accgagtcgg 14040 
caatgagcgc ttctccttgc tccgctgtcc 14100 
ggaaggtgta aggcgccgag acagcacact 14160 
cacgggcatc gctcatctcc tgaaggcaag 14220 
agccggaaaa ccagagcgct ggggaaaggg 14280 
tgcgctaccg ctactgagat ggtgccggat 14340 
gattaccacc gcgaacagga atccccgacc 14400 
tcgggcgccg tgctcgacga ggacgaccgg 14460 
cggatcgccg cgggcgggaa tgcacggcga 14520 
ctccgcaagg ctcatctcgt caactccggt 14580 
cttacctccc ccaaactcgg cgaggcacga 14640 
gcggtcggct tccccacgac gatcaatccg 14700 
gtcgacgtgg aactgggcac ctacaacgca 14760 
gaacggacgc gagccatcat gctggcgcac 14820 
atcgcggaga tcgcaaaaga acacgagctg 14880 
ggatccacct accggggacg gctgaccgga 14940 
tatcctgccc atcacatcac cagcggcgag 15000 
ttggctcgca tcatcgagtc gctgcgtgac 15060 
gtggacaaca cctgccgcaa gaggttcgac 15120 
gaccacaagt acacgttctc ccacgtcggt 15180 
gcacttgcgc tgagccagtt gagcaagatt 15240 
tggcgacggt tgcgcgaagg gctgtccggg 15300 
ccgcacagcg acccgagctg gttcgggttt 15360 
acccgtgccg ccctggtgaa cttcctggaa 15420 
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tcccgcaaca tcggcacccg actgctgttc 
gagcaggtgc ggtaccggat cgccgacgcg 
accttctggg tcggcgtcta cccaggcata 
tcaatcgctg aattcgtggc caagagttcc 
tttcagcaac gtgatcaacc tgcaccagcc 
cgcggaggtg ttcgcctcca actggatcgg 
cgaattcgcc caccacctgg gagtggatcc 
tgccgcgctg ttccttaccg tgcaggtgct 
tccttcgata agcttcgtgg cggcggccaa 
gttctgcgac gtcgaccccc ggacgttgaa 
cacgccggcg accaaggccg tattgctgct 
cgcgatcgcc gatttctgcc gtgaaaaggg 
ggtggcatcg tccgtgcacg gcaccgcttg 
tttcgatgcg atgaagatcc tggtcaccgg 
ggagctggcg caccgcgcaa gacgactcgc 
cgattcggcc aagtcttcca accgctggtg 
gctgatcggg aacgacatga cggcagcgct 
attcatcaac aggcgtagag aaatcgctac 
gggtgtcctc ctaccgccga cgctaccgga 
ggtccagctg gctccggaga tccgcgacca 
ctacacgagc taccgctacc cgcccctgca 
gctgccttct gcggagcacg cctgccgcag 
tgacgacgcc gaggtgcgca cggtggctga 
cagccaaaga tcaccactcc gaaagtgagg 
acctcctcgg tatacagccc agatcatgcc 
ggccgggact gggcagccga ggccggggaa 
gaagcacagt ccctactcga cgtcgcctgt 
gccgaatacg cgaaggtcgc ggggcttgaa 
agacgagtcc ctgaggtacc gattcacatc 
ccattcgacg tcatcacctg cctgtgcttt 
ctgcgacgcg tgacgcggaa catggcccgg 
gaaccctggt ggtttcccga caagttcatc 
cacggcgagc gggtgatcag ccggctatcg 
atgaccgtcc gctacacagt cgccgaaccc 
atcctctcgc tgttcactga ggacgagtac 
gcggaatacc ttcctggagc accgaacggc 
gcccggaaca acgacgcaag acctggctgg 
ccgacgtgac ccgggatcac ttcgatcgtc 
tctaggtgcc aaccaggtcg acgaagccct 
ccgaggacga tcccgcctcg aatctgcgac 
gtgatccgag ccgacgagaa ccactcggtt 
ggctcctgag cagccccaaa gcaccggcca 
ggatgcgcct acccgaccta cgcaaagccg 
gacctaagcc gccggtgagc cacggctggc 
gcggtaacgg cgcccgaagt acaggttgtc 
tacccacagg aacaggtgtt gttcggcgta 
ttgcaccatc gccgactgag cgcctcacca 
tccggtcggg ccgtcgtctg gcagcgtggc 
cggcgtgctg gttccgtgga agccgttaag 
gttgattttg atgttggtat cgctgagttc 
gaacgtcttc gacggtgagt aggctccgct 
ttgcaaggtc agggaagcga cgtggctgga 
gcgtagcaac ggcagcatag cgttggtaac 
caccgcccgg aggctcgccg gtgtgacggt 
gccggcgtta ttaaccagca catcgagccg 
agccgcgacg ctcgcgtcgt ctgtcacgtc 
cgcacgcaat ttcgccacgg catcctcccc 
gctccacccc agcgcgccga gcccggccgc 
ggtgaccagt gcaatcgtct gttcgctcat 
aaccaacacc atctaggtct cacaccgata 
acggatggag acgcgggagt tgcggtactt 
ccgggccgcc cagcgcctgg gcatcgccca 
cgagcaacga ctcggagtcg tgttgctgca 
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ggcggtaaca tcacccggca cccggccttc 15480 
ctcaccaaca gcgacatcgt caccgaccga 15540 
acggaccaaa tgatcgacta cgtcgtcgaa 15600 
tagcatccag catggcggca tctcggagga 15660 
gatcctcggc accgaagaac tcgacgcgat 15720 
gctcgggccg cgcacccgga cgttcgaggc 15780 
cgaacaggtc gtgttcctca actcggggac 15840 
cgacctcggc ccaggcgacg acgtggtact 15900 
cgccatcgca tcctccggtg cccgcccggt 15960 
ccccacgctg gatgatgtgg cgagggccat 16020 
ccactatgga ggatcgccgg gagaagtcac 16080 
cctcatgctc atcgaggact ccgcctgcgc 16140 
cggaaccttt ggtgacctgg ccacgtggag 16200 
ggatgggggc atgttctacg cggcggatcc 16260 
ctaccacggt cttgagcaga tgagcggatt 16320 
ggatattcgc gtcgaagaca tcggccagcg 16380 
tggcagcgtg cagctgcgca aactgccaga 16440 
gcagtacgac cggttgcttt ccgatgtgcc 16500 
tgggcacgtc tcgtcacact acttctactg 16560 
ggtggcgcag caaatgctgg aacgcggcat 16620 
caaggtcccc atctaccgcg cggactgcaa 16680 
aacactcctg ctaccactgc acccaagcct 16740 
cgagttccag aaggccgtcg aacaccacat 16800 
atgtcgcgcg tgagcgacac attcgcagaa 16860 
gacatctacg acgcgatcca ctccgcgcgt 16920 
gtagtccagc tcgtacgcac caggctgccc 16980 
gggaccgggg cgcacctaga gcgattccgt 17040 
ctgtccgatg cgatgcggga gatcgcgatc 17100 
ggtgacatcc gcgatttcga cctcggcgag 17160 
accgcggctt acatgcggac cgttgacgac 17220 
cacctggccc ctggtggagt cgcggtcatc 17280 
gacgggttcg tcaccggagc cgtcgcgcac 17340 
cactcggtcc tggagggccg tacgagccgg 17400 
accgggatcc gggacttcac agagttcgaa 17460 
accgccgcgc tcgaagacgc agggatccgc 17520 
cgaggcctgt tcgtcggaat ccgcaactga 17580 
gcaggccccg ggactcactc gaccgataag 17640 
tctgatgccg gcaagttttg accgtcgtga 17700 
gggccctagc cgcgatcgtc ggcaaggtcg 17760 
cacctcgccg cagctgttcc ggcagacgtt 17820 
cagcccggca cctgcggctg ggaggtgttc 17880 
tcggcgaaga actcgaccgg ctcccgcgcc 17940 
cacccgccat gctggcaacc gcccacgcct 18000 
ccagtgtcgc caggcaaccg cggcgcaccg 18060 
aatgaccgag acggcgaagg ccgacatccg 18120 
cccggtcgca gcgagcctct ccgacttcga 18180 
gggcacattc ccggcgtcgt cgaacatgcc 18240 
gagccgaatg gcgatcctgg caccgtctgc 18300 
gtcggtcgcg acgtagccgg ggcaggcgtt 18360 
cttggcgtac tggatggtga tcgcgttgag 18420 
gatcccgccg aggtcgacgc ccggcgtggt 18480 
ctggttgacg atccgcgggc gctcggagcg 18540 
ccgaacgacg ccgatcacgt tggtctccac 18600 
cgagggctcc tccggccatg ccccggcgat 18660 
gccggcgcgc tcctcgagca gagccgcagc 18720 
cagggatacc gcgaacgcat cgacgccgtc 18780 
gcgctggtgg tcccgtgccc cgattccgac 18840 
gatctcgtat ccgattccct tgtttgcgcc 18900 
gctttctagc gtggcccgaa accggcgggg 18960 
cccagctgat atgaatcagc ggtaggctcg 19020 
cgttgcagtc gccgaggagt tgcacttcgg 19080 
gccgccgctg tcgcggacga tcgcccagct 19140 
acgcaccagc cgcaaagtct cgctcaccga 19200 
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agccggggca 
gcgacgcacc 
cgcctccggt 
ggccgtcgac 
cgacgtggcg 
gaacaccgag 
tgtacggatg 
cgacggcgtc 
aatgatcgca 
cgaaggcctc 
gccgccccac 
aatcttcgct 
ccgcgcgcaa 
ctcacggccg 
tctgctccgt 
ctgcggatac 
agcggaagtg 
cgtcgcggat 
tcgagaacgt 
acgatctcat 
ctgacttccg 
tcgaagctgt 
tgcgccgatg 
gcaccgacgt 
gtcccgcagc 
gcgagagcag 
cgaccacgtc 
aggcctgccc 
ggccgatccg 
gggcgatcac 
acggattgcg 
ctaccaggtt 
cgctcccaag 
ccttcttcac 
cttcactcac 
gcaaagcaac 
gtctcgtgac 
aggaacgcgg 
gcgtcaggac 
gtcgtgctgg 
cgcctggtcg 
cgcccgcagc 
ggggcggccg 
cgcttctagg 
ctcggccagt 
ggagatctgc 
cagttcctcg 
ggcggcacgg 
tgacatgtga 
taaattccgc 
ggccggactg 
ctcaccgctg 
caccgccgca 
cgcgctgaca 
caacgtgacg 
cgtggatgcc 
cgacgggctg 
ccacggcggc 
gttcgacgcg 
gccgagcccg 
cgacgcccgg 
ctacgtcgag 
cgtgctgccc 



atgctgctga 
cagcgtgccg 
gagctgctgg 
ctgctgctct 
ctgttgcatc 
caacaagtcg 
gcggatgtca 
tatccagatg 
ctcggccgca 
gccgccgtac 
agccgctccc 
ccctcaattg 
cacctcatgt 
tcccgtcgcc 
gacgcggtcg 
ggcctcgtcg 
cccgatgcgg 
actccaacac 
gcggcaatgt 
cggctcgcaa 
agacgcacga 
ccgtggccat 
gcgcgcctgg 
ggtccagctg 
cagcgctttg 
ctggctcagc 
gtaggtgtcg 
cgacggcgag 
cacatcgcct 
ccagtccacc 
gctgtcgtac 
gaagatcatg 
ccttggcttc 
gaagtcttca 
actgaaccca 
cggcgagtag 
catgctaggg 
tcagacggtc 
tggtccaagg 
agctgcatgg 
atgtcgaacc 
tgtccccgcc 
tcgtactcgg 
tagcccacgg 
gcgggctcga 
agtgtctcct 
gcgagccggc 
tcgaggatcc 
cgaggatagc 
ttcacccacg 
gaaatcgagg 
ttcacggcgt 
ctggtcaccc 
ccgggggcac 
acactgccgg 
tggtccaccc 
ggcgacccgc 
gacacgatgt 
gtgttcctgc 
ctgcccgccg 
tacgcagtgc 
gtctccgacc 
atcgaggaat 



ccgaaggccg 
cgacgagcca 
cgaagttgct 
gcgaatccca 
aacccttcga 
ccattcttcc 
gctcactgcc 
gccccggcgt 
ctaccgtggt 
cggttctaga 
gagcactcgc 
cccgcagcta 
gcctcagcgg 
aggagccagc 
atgtagccga 
gcgcattggc 
ttggccctga 
cgcagcagga 
ggcgatattt 
ctgcggagcc 
cgcgacgctc 
gacacgacgg 
agaagatcag 
gagctcggag 
gcgtgcggcg 
acctcgtcga 
ttcaagaagt 
ctcatctcga 
tcgcctgcct 
tgaccgtccg 
cgcagctcca 
gtggcgagca 
ccataccctc 
gaccctggtg 
cgggactctg 
cttttcgctg 
cgtactcgcc 
tggtcctcgg 
cccgttcgat 
ccgcgttcag 
cggcgggcag 
agcgcgacca 
tgagcgccgc 
cgttggcgca 
ccacccggtg 
tggtgccgaa 
cgatggtgat 
ggctcctggt 
gtacgcgcgt 
gaccgaagga 
cgggcggcaa 
ggaaacccgg 
acctgcaccg 
cggtgctccg 
ccgagcgcga 
gcgacctcgg 
agctgaactg 
tccacggcta 
ccgccaacgg 
cgatggaccc 
cgatgcacta 
cggagaatga 
ggctggacct 
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ggcgatcctc ggcgcgctgg cagcagccga 19260 
gccctcgcta gtcctggctg ccaaggccgg 19320 
cgacgcgtac gccgccgagc cgggagccgt 19380 
gccccagaaa acgctgcatg acggccgggc 19440 
cccgacggcc gaactcgaca tcgaaattct 19500 
gacctcgcat ccgcttgcca gcgagcccca 19560 
ggatctcccg cttgcgcgct ggcccggccc 19620 
ggaagtacgc aaccagacgc aactgttcca 19680 
catgcccgaa tccagtcgcg tcaacctgct 19740 
cgcgccggac gtgacgacag tcatcgcctg 19800 
cggcttggtc cgcgtggcca cactcctcta 19860 
ggacgaatgc cgtgcgactg ccggaaacga 19920 
gagaagacgc tatgcggatg ccgcagagac 19980 
ggtcccgttc gagctggtag gtttcggcga 20040 
agctggttgc gagcattcgt tgcgtggtcg 20100 
ggcggatctc gtcgaggcat cactacagga 20160 
tcagcgcgtt gatggaaatg tttcgccagt 20220 
cccggcgaat atcccaacgg aatctcgaat 20280 
tcctcaacac tcgtccgtag aatgctgagg 20340 
tcggtaatca ggtgcgtgta gacgtctcgg 20400 
gatgccacgg ccgcctgtag cttcgaggaa 20460 
tactgcggcg ggagcagaag cagcgcttcc 20520 
caccttcggc ctgaccgagc cgggcgcacg 20580 
acacagcgtg gcggtgttca gcgccgccag 20640 
cttcgggaga gctgggccat tacaggttgt 20700 
cgttcatcca ccggtgttca tcaccaggtt 20760 
ccgcaagctg gtcagcattc acctcgaagg 20820 
tcacgaccaa ccccggaaaa ccccgtcgag 20880 
cggcgatcag cccgtcggcc agcaggtcgc 20940 
tgcctacgtt gagcttcatg gagatctcgt 21000 
cgccgactgg cgcgatcaca cccgccgggg 21060 
ccgtgccgtg atcgtttgcc atgatgtcca 21120 
ataacgaagc atgcccaacc cgggttcata 21180 
gcagtccccc tcgggctccc agcgaggaac 21240 
ggcccggaac tcccgaagct gacccaaggt 21300 
cgccagtcaa tcaatgacgg ctccgggaat 21360 
gaaaaatggg atggatacgg gctgaccagc 21420 
gaaagaatcg ggcgggtttg tctggcgcgg 21480 
cgcgcggcgg gcccggccgg cggccgcccg 21540 
tgccagcccg gtggcgacga tctcgaacag 21600 
ctcgccgttc tccaccgccg cggtcaggtc 21660 
cacctcggcc accgcgtcgc ggacccggcc 21720 
ggtcatcagg cagccgccgg gcagcagtgg 21780 
caccgcgcgc agccgccgca gacccggcgg 21840 
ccagaagtcc acgaatgcct tgtccagcgt 21900 
gtgcttgtgc accccggact tgctcatctc 21960 
gccgtccagc ccttcctcgg atgcgatctc 22020 
ggcctgtgct tcggccgctg agcgtcgcgg 22080 
tcgctattga tttagtgaac gtgcgtacgc 22140 
gtgcgagatg cgggtgcgac gactgggctg 22200 
gcgactggtg atcgactacg tacgggacct 22260 
cgagaggctg gcggtgccga gcgggacggt 22320 
ggaccacacc gacgcggccg cgctcgcggc 22380 
acccgcgccc ggccacggcg acgacgtgga 22440 
gctgacactg caccgactgg ctgccgaggt 22500 
gccgttccgc gtcaccgcgg tccccgccgt 22560 
ggtggtgcag gccgacgggc agcgggtctt 22620 
ctggtggctc atcgcgcgcc ggttcagccc 22680 
cgcggtggtc gacgcgccac acctccagcc 22740 
gaggcaggcc gccgcagccg cagagatcct 22800 
cgaggcggaa cagccggaca agatcgcggg 22860 
gttccgcacg cacgccggac accgcgcaca 22920 
ggccacttga ggcggaaact ccgccgcgcc 22980 
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ggcgggctca ggtcgcaaca gaacctaggt 
cggttgctcc aagacactct ccggtcgcgg 
actgctggaa accacgatca aatgcgacac 
cctggtcctg ggcgggacgg acccttgggt 
cccacggcgg ccgaagcgtc ccggtagcag 
gctggctgcg gaccgggccc agaggcgagc 
ttgtagatcg tcgagcggaa gacgcaccgc 
cccgcccgag ataccggtta gcgtggtcag 
gcatatgtcc ggttaccatc gattcgatgc 
cgacgggtac ctgaatgccg cgctcaacga 
ggcgagcgcg ctggcggcgg agttgggcga 
ctcggggctc accgccctgt tcaacgaaga 
cgctctcgcc aagcatcggc tgattggcga 
gctcgcgttg gcagtcgcgt tccttggtga 
gtgccctgcc gtcaaggaga cctacagcga 
gaagggaata gcagacctac tagcgggcaa 
gagtcttcgc ctgcggctcg gcttccatga 
tctggcctgc atagccagcg cagacggtga 
gtcccaaacg atcaaaaaga acgtcgatgc 
cctacacgac cactacgagg cagtagcacg 
aggcgttcac cacgccacac agctcgactt 
gggcaaagcc ggggaaccag caagctccag 
tccgcgcgag cgagaggtag cagcgctggt 
cgccatcatg gtgatctccc gccgcaccgc 
actcggcttc accactcgga cgcaaatcgc 
acgatgcgac tgacaaccgc cggttttcgg 
ctgcgagacg gcccctctcc acagcctctt 
cgcttgagga cggcatccga tgcccaccgc 
tgatcgagaa cgctgccgtg gtgacgatgg 
acatcctcgt cgacaacggt gtcatcgtcg 
ccgagcgaat cgacgcctcg tcgatgatcg 
acacctggca aaccgggctc cggggcattc 
ggggattccg gctgcagatg gcgacgaagt 
acctcggtgg cctcgattgc cttaactcgg 
acatcgtcac cggcgagcac gcgcacgccg 
gcgcgctgta cggccacggc ctcctgccca 
gcgggctgga cgagtcggtg gaagaaggcg 
aaatccgcgc ccagtacttc ccgagcgagc 
aggaactagc catcgctccc ttcggggacg 
tcggcgcacg catcacgttc cactgcaacc 
tcgaggcgct gcactcccac aacatgctcg 
tcagcactga gaacgaatgg cacctgctgc 
ccgagaccga gatgcagatg ggcatgggat 
cgccggggcc aagcctcggc atcgactgca 
acgcacggct cgtcctacag gtgacccgat 
cgacactgcc cacggtcatg cggtggaaaa 
acggtgcccg cgccgcgggc gtcgacgacg 
ccgacatcgt attgctggac atgtcaggca 
cttgtggcgc gatcattgcg cagaccaact 
ggcgggtggt caagcggaac ggccggctgg 
cgctcgcgga gtcgcacggt catctctacg 
cgcaacctcc agccgagcta ccggtgttca 
cgcgccattc gctgcgtgag gtgacccggc 
gctgctgtgt gacctgcacg gacgaggtta 
cggtcgctca tcatgccgcg cccaccgtca 
cggccgggcc cctggaatag ccgcttccac 
cgatgatcgt ccccgtgaac gcgtggggct 
gaaggcgggc tgcgggaacc ggcgttgact 
ccacatcaat ggggagggcc cggtgacgac 
gtccgaacgc accgtgaccg ttgaacggtc 
agcaactcga cgaacgcgcc ggcgatcttg 
gacccggctc aacacatcaa cccagccccg 
cgtggcaatc gccctaaagg agggccagct 
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ccgatggatg gtccggaatc gcgcgggaat 23040 

cggcgtcaat cacaagattc agcatccatt 23100 

tccgggttga gccgaaccca cgggtccagt 23160 

tcccgccgag gaaccctcac ttgcgctgaa 23220 

acgccggaaa ccagcgctta gctctggaac 23280 

gctcggtggg tgctgagttc ggggacgtac 23340 

acgcttgagc ttgaacggcg cagcacgcga 23400 

atcaacactg cccgttgtgc accattagcc 23460 

ggatccggta cgtgtcaaag ctctctacgt 23520 

cgcgtcatcc gggtcagccc ttctcgagca 23580 

ccaatcaggg gcggcctacg tcgcgcagat 23640 

tccggcgcgg gcgagcgcgc tcttcgaaga 23700 

tcacgctgcc gccgcctacg atcagatcga 23760 

tcatgagcgc gccgctgccc tgttccgcgg 23820 

acagtggatg agatcgcttg ccctgtgggc 23880 

ttaccggcaa gccaatgctg cggaactgga 23940 

gcagtttcag atcgcgttgt gcgttgagat 24000 

cccggcacgg gctgcgacct tgttcggcgt 24060 

gtcgctcgcc ggccacaaac atgttgcgcg 24120 

tagctcactc ggcgacgaga cgtttcagaa 24180 

cgacgaggcc atcgcatgga tacggaatcc 24240 

gacggaagaa acgcacgcgt cggtgcttac 24300 

cgcacagggc accaccaaca aagaaatcgc 24360 

tgaggcccac gtcgagcaca tcctgaccaa 24420 

gacctgggtc attcaacaga agaccgcgcg 24480 

attggcgtgg accagtagaa gctgccccgt 24540 

ctccacagtc gatcaacgac cgaacgggac 24600 

gcggagagga gcacttggcc atgcgcacac 24660 

acgacgttct cggcgacttc gaacgagccg 24720 

agatcgggcc gaacctcaac gttggcgaca 24780 

ccatgccggg catggtcgac acccatcgac 24840 

tggcggacgg gaacatcctc gactacctga 24900 

accgcccgca ggacatgtat gcgggcaact 24960 

gggtgaccac gctggtcgat tactgccaca 25020 

cggtcgctgg actccgcgac gcgggcgtgc 25080 

tcacgtcgaa cacgtggtcc gagaccaaag 25140 

acttcgcccg gcgggcgcgg ctggcccggg 25200 

aacaattgct gcgcttcggc atcgccccgc 25260 

tcaagcagga attcgagctg gcccgcgagc 25320 

aagtcatcgc gcgcaacctg ttcaaggaca 25380 

gcagcgacct tctcttggtg cacgggacat 25440 

ggggtaccgg caccatgatc tcggtttgcg 25500 

tcccggtcat tcgtgaggcc accgagaaca 25560 

cgagcagtac ctgcggcgac atgatttcgc 25620 

ggagggacga tcaagaagac tacgcgcggt 25680 

cgcgcgacgc cttacggtgg ctcacggtca 25740 

tgacaggaac cctggcaccg ggcaagcggg 25800 

tcagccaggc cggttggaac cgccacgacc 25860 

cgggcaacgt gcacacggtg ctggtcgatg 25920 

tgcacgtcga cgtcaacgga gcgcttgcaa 25980 

accaaatggc ccagcacggc ggattcatcc 26040 

accgctgaga accaagccag ctgtggctgg 26100 

tggccgggct cgaccggaaa accgtcagac 26160 

tcggcatagg caaggccagt cgcagcagca 26220 

gccgcgtgcg agcgccgatt cgagcaacca 26280 

ctgcacggag caggtattcg cctcgtggac 26340 

gccgtgcggg aacgcgccgt cttccggggg 26400 

gaccactcgt gcgcattcga gcatcagcgg 26460 

aaagaccgtc aggtgtggtg gcaggacgcg 26520 

ggccgagctg tcgtttcttg ccgcagcaac 26580 

ccgcgccggg gccaccaacc gcgaactcat 26640 

ataacagggt tcgagccctg cctgtcgact 26700 

gtggttacgg atcgtgttcg cgtgctcctt 26760 
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tgtgaagcgg gcgtgagtga gttcatagag 
acatggagaa cggcggaccc agccggacag 
accagatcgc tgaccgacca cagatcttca 
tcaccgcaga ggaactgacc aaattggcca 
caatcgatca gccccgccgc ctgttcttcg 
tggccgcggc catcgccgat ggtgtacggc 
ccttcgccta tcgcaatccg cgcccggact 
cccaagcatg gaaacgcgaa cgccttgcta 
ccttcgtacc ggtcgatttc gaaacccagc 
tcaaacggac ggacccggca gtattcgtgt 
acgccgccca cgccaccctc gaatacatcg 
tcgactacct gcagcctgcg gccaccgacg 
atcggctggc cggcgccggc gaacccttgt 
cacagcttca cgccctcggg ttcaccggca 
ccggctacct ccacggatcc gcaggtttcg 
gccgcatact gcacgcgagc cgctgaggtt 
tcgcgacggc tagcatctcc cacatcgtcc 
gagcgcacgc gggtagggcc ccggcgaaca 
tcgcgtctcg ataggctgcc aatgcgccgt 
ggtcgtggcg tagtcaagga accgctgcag 
atcgccctca taggactgga gatgggcgag 
gtactcgtcc ggcccgaacg acgcctcgca 
gaggaattcg ttgaacagcc tgagactgcc 
atgccgttct ccggtcggtg attgcgacca 
cggatactcg ttggtgaact ccgcgaaccg 
gtgcgggaac attgccttga tcaaaccgca 
caccagcagt cactggagcc ggcgttgcgt 
tgcacagcaa cgttgttgaa gcacaacgcc 
ttccagccgt ctcggcgcgg ggacgttcgt 
gtcggtttgg gcggttttga gtccgtgcga 
aggaggtgcc gtggttacta cgcagcggaa 
tctgccgggc gtaacggccg agttccaccg 
tcttgctagc gcggctgagg ccgttcgggg 
ctatcgtggt cttggagcgc tgaccctgtt 
gattggcgtg ggtgtcgcgg tcgcgcagcg 
tttgtagcgg gttccgggct aggcgtcgct 
gaacctgggt aggtgttgcg ttgcctgcgt 
gaagaacccg agcacacagg cggccttgaa 
gcgggtcgaa cagcgtggcg aagccgtcgg 
ggacggatgt gcggctgttt cgaggttcgc 
cacggtatct gtgcgcgcgg gaggtagtgg 
gcgcaggcgc gcggcaacaa ggggcgcggt 
tggcacagtt tcgtcgggtc gatgttcgca 
cggtacgggt cggttggggg ctcgaatggc 
cacgggcgtc ggtcggcttt gcgtcggaca 
gcgcgttggc ctcgcagatc gtggcgccgg 
tggggcgcgg gcggcgggcg tggtcgtcgg 
ttgaccagcc gggggctgag gaggctggcg 
cgggtgttgc gtgggtcgag atcaacgctg 
cgggcctggt tggggtcgct gagctggtgg 
gtttggacga ggaaccgttc gctgcggtcg 
tgatcgctga ggcgacggtg ctgggtgcga 
cccggatgct gccgctgccc gtgctgccgc 
agtcggcctc ctgggtccgg gggccgctgg 
tgttgttcgg gtttagcagt gcgttgggga 
tggcggaatc ggggtaccgg ttctgcctgt 
gggcgcggtg ggaagccgga acgggggcgc 
gggttcggcc gcgtcctgtg ccgttgcccg 
cggctctggg tgggccggcg gcgttcgcgg 
gtgcgcaggc gatggtggtc gcgggcgcgc 
tcgcggcgca gctcgggtac gggctgtcgg 
tgttgcggcg gttggaccgg gtcgacacgg 
gtgctcggtt ggtgcatgac gtgattcccg 
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cccgcgccgc gttcgtgagt aagttcttga 26820 
cgctcgctac cgcgtacgcc cgcgcctatc 26880 
ccgaccccct ggcggtacgt ctgctcggcg 26940 
cgcccacaac cgatcacctc ggcgccgggg 27000 
ccgcccgtgc ccgcttcgcc gaggacgccg 27060 
aggtcgtgat cctcggcgcg ggcctggaca 27120 
tgcgcgtctt cgaggtcgac caccccgcca 27180 
gcgcagggat cgaccgccct gagacgttga 27240 
cgctggcaac tggattggaa tccgccggat 27300 
ggctcggcgt tgtcttctat ctgaccgcga 27360 
ctggtcaagc ccagccggtc gaggtggtct 27420 
aggaccgtgc gcaccagcgg gcacgtgccg 27480 
tcagctactt cacaaccgac gacatcgccg 27540 
tcgaagacca ctccgccccg gacctcatcg 27600 
agggcgaacc accccgcgca ctgcgcgcga 27660 
tgcctgccgc gcctacttgg accacatctg 27720 
tctgaactgc acaaactcaa aaaatcgcgg 27780 
agccctagcg atacatcact ttgaacaagg 27840 
tgcgtttcgc gcgaaccgcg cggtcgactt 27900 
ctcatcgcgg gtgaacgggc ggtcgtccgg 27960 
tgagttccac tcgtgcacat tcgccaccgg 28020 
cgccagcgcc cagccatacc ggctgtcaca 28080 
ttggtaggcg cggattgtgg atggcgcgag 28140 
ttcgtccaca tgcgacggtg tccaccgcca 28200 
gcgcaccagc cgctcccgca ccccggccag 28260 
gtccagacga tcatgatcaa cgagtgtgat 28320 
ccacacggtt ggccaacccg agcaggtggg 28380 
tgactggttc aggtccggcg gcagcccgta 28440 
gttcagagat aaaatcgtgt tgcccggttg 28500 
ggccgattat gcggcatggg gcctgtgttg 28560 
accggcgggt gctgggggaa cgacggtgag 28620 
tcccgatcac tacattccca ctcgtgatga 28680 
cgttctcccg gaccatcaga cgatggcctt 28740 
ggggttgatg gactggccgg tggcggcggc 28800 
tgccggcgga ccggcgtcga agtgatccgg 28860 
gagcgcgccg cggcggcgct caggccaggt 28920 
ccggcgtagc tgcatgctcg gagccgtttc 28980 
ttcaccaccg ctgggactac tggtgtgtcg 29040 
tgttgctgat cagctcgcag gccaccgaga 29100 
tttccgctcg tcggtcctac gctcgtgact 29160 
tggctaggac ggcgcggaac acaacgcctg 29220 
tgcgagccgc ggcggtcagc ctgcccgtcg 29280 
tgccgggtcg ttgggaggac gcggttcgcg 29340 
cggttgcggg ggcgaccggc gcgggggtgt 29400 
gctgagtgtg ttcgatgctg cgtaccttgg 29460 
tggtggctgt cgcgcagagc atcgcgggct 29520 
aggggcgtgc gtacgtcgag gtgcgcggtg 29580 
agcagatcgc ccggcggttg gagaaggtgc 29640 
tcgtggggcg ggtggtcatc gggcacgatc 29700 
gtgtggtgga ggacgcggag gcggacaccg 29760 
gtgtgagcca cccggacaat ccggtcaagg 29820 
acttgaccgg cgctgtgatc gcggtggggg 29880 
cgtctgttcc ggcgctggtg tcgaccggtg 29940 
aaacgcggat cgggaagacc gccactgacc 30000 
acacgcttgc gcggcgtccg gcgagtttgc 30060 
tgcgggagac gcaggcggcg cggcggagct 30120 
atcctgccag tcaccgtagt gcgccggtgg 30180 
ccggcccggt cgagcgcgcc gcgaacggtt 30240 
gcatgctcgg ggtgacgcgg agttttcagc 30300 
cgcgcgcggc gcaggtgggg cggggcgcgt 30360 
agcggacgac tctggtgttc gatgcccggg 30420 
tggtgatcga cgcggacgtt ctgcgcaccg 30480 
cggacggggg gcgcgcggtc gtcgagttgt 30540 
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gggaacgtgc gcatgaactc gtgggtgtga 
ggtcgctgta ctcggtcgct cggtcgcggt 
ggggtatcga ggtgttcgcg ctgcgccacg 
ccgaggagct ggatccgctc gctgaggagt 
tggttctcgc gggtgatggg ggccggttgg 
cgggtggggt gcggctggct tcggtggtgc 
gtgtggtgtc cgcgcgggcg cgggcggcac 
cgcgtggtgg tgaggttccg tggggtgcgc 
cgtgtgcgtt gttgagtacg gtcccgaccg 
tgtcgttgat cggttcggtc agcggggcgg 
gccaggtccg tgcgggcgtt ccggtggccg 
cgtggtgggg gatgcaggcc gcgtggcgtc 
ggcatgcgat gggccccggc atggtgctgg 
ccgatgatgc ggcgcacgag cgtttccaag 
tgggtgtgtt gcgggcgtcg ctggaggagc 
gtggggccgc gatctcggcg agtatcggtg 
tgctggggat gaacgcgttg atcggcggcg 
acaggcttct ggaggccagc gttgttcggg 
agttgccggc tgacgagctc gtgctcggcg 
ttccggccga ctgccggctt ctggaggccg 
ccggggagtc gatgttggtc accaagacgg 
accggacgtg catgatctac cgggggaccg 
tggtcgcgac cggcgcgaac accgaggcag 
gtgccgatgg tgtcgcggct cgcctgcaga 
tgggagcggg tgggattctg ctcgtcacgg 
cgctggggcg ggcggtgagc ctggcggtgg 
ccaccgtcgc cgaactcgcg gccgcccggc 
actccgcgac gatcgaggct ttggggcggg 
ccctcaccga agggcgcatc gcgctgcggc 
tcgaggaaat ctccacagtg gaacggcgag 
tggtggagaa cggagaactg ctcccgcatc 
gtggcgcggg agtgcacccg gccgaaggca 
ccttcgagcc ttcccgtggc taccacgcgg 
tgagtgtgaa aggtgccccg gagatcgtgc 
atggcgacgt gccgttggag gagaccgctc 
tcgcccggca gggctaccgg gtgctggctg 
atctcgacga gtcccggatc cgggatctgc 
cggtgcggcc cactgcggcc gccgccgtcg 
tgatggtcac cggcgaccac cccagcaccg 
tcaacgagcg tcgcgtcatg accggcaccg 
cggcggaagt gaccaccgtc gcggtgttcg 
tcgtcgacgc cctgcaacgc aacgaccgtg 
acgccccggc gatccggttg gccgacatcg 
ccgcccgcga ggccgccgac ctcgtggtca 
ccattgtgga aggccgcgcc atgtgggcat 
gcggcaacct cggcgagatc ggcttcaccg 
gcctcaacgc ccgccagttg ctgctggtca 
cgatcgcggt gcgaccaccc ccgacgatca 
aagcctcgct gggcgctgcg ctgacccgcg 
gggccgccat ttgtgcgtgg atcctgggca 
cggtcgcgct cgtggcgctg gtcgccgccc 
ggacaccact ggtcatggcg tgtgcgcttg 
aaactcccgg cctgagccaa ttcttcggag 
tcgccgctgg cgcaacgacc gccgccaccg 
cctccatgca tgagcgtcaa gatcccttca 
aacttcgagc aggcaagacg ttggagcgcc 
agcgcctgcc cccagccgac gggcggtgga 
cggtcggtca tcaggatctc tgtcgtgagc 
taggactcgg cgatcactag gtcgtagtcg 
ggctcggctt gaacaatcac cacatccagt 
acctcgggga acaggatgaa ggtgacggtc 
atctcgccac gtgagcgatt gatctgagtg 
tccgctgcct ccacgacgag ctcacctttc 
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ccaagggggt cggcgcgcag cgtgatggtt 30600 
tgcccccaga cgtgcgcgag gtggccggtc 30660 
attcgcgcac ggccgcgtgg gttctggctg 30720 
tggtcgcggc agctcgcgag gtcgggtcgg 30780 
cggggcggat cggtgttgat cgcgtggtcc 30840 
gggagatgca gcgggagggc agcgtggtcg 30900 
tggcggctgc ggatgtcggg cttggtgtga 30960 
acctgatctc gggcgcgagc ctgggggagt 31020 
cgcgttatgt cagcgtgcgt agcgcgcagc 31080 
tgttcggcgc gttcgggcct gccatcggga 31140 
tgtgtgcggc aatcgcgctc ggcgctggga 31200 
cggttccgcg cgcggtgccg cgcactgcct 31260 
agctgctcgg cagttcatcg cagggtctga 31320 
gtcaggacga ggggacgggc cttcgccgga 31380 
tggcgagccc gttgacgccg gcgttggcgg 31440 
ccgttgcgga cgcgacgatc atcatgggcg 31500 
tgcagcgcat caccgcggac cgcgctttga 31560 
ttgtggtgcg ccgtaaggaa aatgtgacgc 31620 
atgtggtgga gttggtggcc ggggatgctg 31680 
acgggctgga ggtcgatgag tcgagcctga 31740 
tgacggcgac tccggcggtg ccggtcgcgg 31800 
cggtcgctgc cgggcgggcg gtgggtgtcg 31860 
gcagcgcgat cgggcccgat gcccggcgag 31920 
cgctggccaa ggtgacggtg ccgatctgcg 31980 
acatgctgcg taggcgtcca ttggggacat 32040 
cggccgttcc cgagggattg ccgttcgttg 32100 
ggctgtcgaa acgcggggcg ctggtgcgta 32160 
cgcaggtgtt gtgtttcgac aagaccggca 32220 
aggtatccga tggcgcgtcg agccggtggc 32280 
tgctcgcggg tgccttgcgg gcaagtccgt 32340 
ccaccgatcg cgcggtcgtc gatggcgcgc 32400 
agcagcgctg ggtaccgatc gacgagatgc 32460 
tgctcggcac ctggcgcgac ggcaatctgc 32520 
tggcccagtg cacccggtgg cgcagatcag 32580 
gtcgaggcgt cgagcgagtt gtcgaaaacc 32640 
tcgcggagcg ggccgcctcg gaccgcagtg 32700 
ggctgatcgg tttcgtcgcg ctggccgacc 32760 
cgcagctcca gcaggcaggt gtcgaggtcg 32820 
ccgaggcgat cgccgccgaa ctcgacgcgc 32880 
aactggacac cctcgacggc gaccaactgg 32940 
cgagggtcag cccggcgcag aaggcgcgca 33000 
tggtcgcggt gaccggggac ggcgccaacg 33060 
gcatcgcgct gggcgagcgc gcaacccccg 33120 
ccgatgaccg catcgagacc atcgtcgacg 33180 
cggtgcgtga ctcgctggcc atcctgctcg 33240 
tcgtcaccgg tgtgatcaac ggcgggggca 33300 
acctgctcac cgacatcctc ccggcgatgg 33360 
ctgcggagat gctgctggca gaaggccccg 33420 
acatctaccg ccgcgccgcg gtcaccgccg 33480 
gggcgaccgg aaccctgcgc caggccaaca 33540 
agctcggaca gaccctggtc gtgcgcggac 33600 
tctcgttcgt ggcactggcc gctgtcgtcc 33660 
ccagcccgct actcccacac ggctggatga 33720 
tcatcagcct cttgctccaa caatggtgaa 33780 
ccacagccat cgtcgggccc gactcgggac 33840 
gagagcggtg aacgactcgc cagcggcgaa 33900 
cccacaatgg gagctgcggc gacgtgcatg 33960 
ccgtccggat agatgaccgg ccactgcggg 34020 
tcgacctgga gaccctgaag agccgcatcc 34080 
cctggacgcc gccggcgtag acgagtcagc 34140 
tgcatggagg ccacacgcag acgttgcgag 34200 
cgcagcgtct ccagctcagc gatcacccgg 34260 
gccgtcaggc ggacgcgacg gccttcgcgc 34320 
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accagtagcg gagcggatgc cgcccaccgt 
cagcagtggc ctgaccagtg ccaacatcgc 
gtggtacagg ctgggcaggc cagcgggcaa 
acatcggccg agatcgtaac gaaccccgtc 
caagctgtaa gtgcgctgcc agccatgacc 
gtgcggtgag tcgtgtctgg ggttgcggtt 
acgagctttg tggggcgccg tcaggcggcc 
cggttagtga ccctgactgg tttcggcggg 
gcacacgacg tgcggcgggc cttctccgcc 
cgcgatccca tgctggtgcc gcaggcggtc 
acacgtgccc cggaggccgt gctcgtcgat 
ttggacaact gcgagcacct gctgggggcc 
gcggccccgc ggttgcggat acttgcaacc 
cagacctggc cggttccgcc actgtccgtg 
caggaaggtc gtccctacca gcgtgaagcg 
gcgctaccag ggttcaccat caacgaagaa 
aggctggacg gcattcccct agcgatcgaa 
gtcgggcaaa ttctaacgcg aatggagaac 
gccgacgatc ctcggcatca gaccctgcgc 
aatgagcatg agaagcagtt gtgggcgcgc 
gacgcagccg aaaacgtctg caccggcgac 
ctcgcgggac tgatcgacaa gtccgtgctc 
taccggatgc tggagactat ccgtcaatac 
gaaaacgctc tgcgctgccg gcaccgcgac 
gccgagtccg cgggcccgca tcaagccggt 
aacttctggg cagcgttgga ctactgcttc 
cggatggtga gcgcgctgtg gttctactgg 
atgtggctgg acagggcgtt ggccctggag 
ctgtggatca ccggatggat tgcccaccgg 
cttaacgaaa gccgcgagct ggcccggaaa 
actcagtttc ttggcgaccg ggagatgtgg 
ctcgacgagg cactggcccg ccaccgcgca 
attttctcaa tacgagcgca ggcagcgggc 
cttctgcacg aatgccgcaa gatctgtaca 
gcggagtgga acatggcgat cacttggtgg 
tccgcgtcca ggtccctccg caataagcgg 
tgcgttgaac tgctggcctg ggtcgccgcc 
cttttcgggg cgttggataa aatgtgggaa 
acgctactga cctggcgtga gcaggccaag 
gaatacgaaa ccgcccgagg ccaggggacg 
gctcttgggg agaagccacc gccggtcgaa 
ccggttctca caaaacgaga acgcgaggtc 
aaagagattg cggctgacct ggtcatttcc 
atcctttcca agctgggctt tacctcgcgc 
agataacgta gtcacctgct ctgttggccc 
gatccgttcg cggcccggca acaccgcccg 
acatccgctt cagctaatcg tccgcggccc 
tccatgatcg gaggctggcc attgccgatg 
tagttgcggc cgcccacata ggacccgttg 
ggccagcttc gtaggccttg gctgttgccg 
gtctcccagc accatcgagc caccgttcgt 
gcacacgtcg atagcgggct cttgagccaa 
gagtccgcga ccagcgagcg ccccggcgac 
cgccacgtct gccgatgccc ggttgtcctt 
agctttcgtc atcgagcctg tcccacagtc 
gccgcccgtg cgaaggccaa cgaccagacc 
tcgccacgca cagcctcgat gaaccgtcga 
cccagatcga tcttcctgga cccggccgga 
atcgactgga tacccgtgcc gatgagcgct 
ggggcatgtc cggtcacgaa gctttccacg 
gcgcttccgg agattgtctc atcgattccc 
gccacgaccg ttagcggatc ggcattgagt 
gtgaggtcga gtgcgactcc ggcgcccgct 
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gccgccgcag ctggcccgcc gcataaccgc 34380 
ccacacgggc ccggtcggcg accttgatca 34440 
cgcacgccct ccccgcaccc accacgatca 34500 
ggtgggttcg aagtggttcg aacagttgtt 34560 
gggctacaat aaaacgctag ttctaccgag 34620 
cgggaggctg gtcgcctccc atatgaggtg 34680 
acaagagtta agcgcgcgct gtccagttca 34740 
atcgggaagt cccggctggc gctgcacgtc 34800 
gggacatttc tggtggagct ggctaacgtt 34860 
gcggctgctc ttgggatcca tgaccgttcg 34920 
tacttggcgg acaagcaagt cttgctggtg 34980 
tgcagccgcc tagtcgctgt gctgctggcc 35040 
agccgagaac atttagggat agtggcggaa 35100 
cctgctggca gcgattcatc aacgtcacaa 35160 
ttgaccttgt tcgagcagcg tgccgcggcg 35220 
aacgaacaga tcgtggccag gttatgccgg 35280 
ctcgccgccg tgcgcgttcg cgtgctctca 35340 
cgattccggc tgctcaccac cggcgaccgc 35400 
gccgccgtcg actggagcta caacctgtgc 35460 
tgttcggttt tcaccgacga attcgatctc 35520 
gggctgaccg atgatgacgt cttcgccgct 35580 
acccggaccg aggagcgcac gaagacgaga 35640 
gggcacgacc ggctcaccga ggcgggtgcc 35700 
tactacctgc atctcgccga gcagtccgat 35760 
tggctcgcgc gtctgcgcgc cgagcgcggc 35820 
accacacctg gcgaggcgtc ggttgggttg 35880 
gctgctggcg actatctgag ggagggccgc 35940 
accgaaccca ccagccagcg tgcccgcgca 36000 
caaggggacc gagatggggc gctggtcctg 36060 
ctgggcgacg aaacggaatt tacctacgcg 36120 
gacaacaacc tgactcgcgc cacgcaactg 36180 
actggccact ggaccggccc cgccttagcc 36240 
ctgcttggag accttgaccg ggcgatggtc 36300 
gaacttggcg agcgctggag gctatcgtgg 36360 
gccgcgggcg acccaaccaa ggcagccgcc 36420 
gaactcaacg acctactcgg gatcagcgac 36480 
gcagaacgca atcctgaacg cgcggcgatc 36540 
ctgattggca ccccgctgtt cggatccgag 36600 
gcacgcgccc aggaagcact cacagacacc 36660 
cggatgaacc gggagcagac tatcttctat 36720 
gcggccgccg cttcggctcc ggaaaccgaa 36780 
gctgccctgg tcaccgctgg caaaacaaac 36840 
cagcgcacag ccgaggccca cgtcgagaac 36900 
gcacaggtca tcagttggat gaccgggcag 36960 
atgcccttct cgggccagcc cggctcacat 37020 
tcgtggcaag gtctgcgata ccgaaagcgg 37080 
ccgcaacggc gcatgttgaa gtaggtatgg 37140 
gccatgaccg cggagttcgg gtgctgcatg 37200 
gcgagcttac gcggtataga ttgagccacg 37260 
ccgacgagta gtgagtagcc ggtgtgaacg 37320 
ccggcttgta gtgcccgtgc gcgaagcctt 37380 
aaactcgtcc cgttccaaca cgagcacatc 37440 
gatcagtcag cctccaccgg acccgacgat 37500 
gtgcatgatc tgttgtgcct aactgcgccc 37560 
tgccgatgcc cggtccgagc agcctccagc 37620 
ccgtcttggc cggaggccgg gggctcggcc 37680 
acacccacga cgtagaggtc ctcgcgggta 37740 
ccgacgtgtg tcagcgtgcc gaccggtgac 37800 
ccttcggttc cgaaaacctc caggcatgtg 37860 
aatgaagcat gcacaccgct gacgaactcc 37920 
accctcgtca tgtggccgga cgaggtatgg 37980 
acgaaccgga gacagtccgc gtcgtgcacc 38040 
gccgaatcgg tcgtccgcca tctccgcgcg 38100 
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gcgggcggca gtcggatcgc gttccggacg 
gcggcgattg catcccggat caaacgatgt 
gcgaacacta cgcctgcctt ctccgcggca 
agggcaagcg gtttttcgca caggacgtgc 
tcgtgatgtg ccgagttgat actcgagatg 
gccacggtga gttcgtcatg gccccgggcg 
gccaggtcgc cgctgacgac gctgaccacc 
cctacaaatg cgcgggcgat gtcgctggcg 
gtcatgattc gatgggttcc tttcgggaat 
acagacgtga ggccagcgcg ggtggccgcg 
cttgactgtt agcaagatcg gaggtttcgc 
ctgtcctggt agtgccaggt cgtcaggatt 
cgtctcccga gccggccttc cggatcgaac 
ccgtagcgtc ggcgtagctg tttccaggga 
acgacctctt tccagagata cgatcgtagg 
ctgaagtacg ccttccaggc agcatccggt 
gggtcgccgg cgacccgaaa accgacggca 
gaggtccgtg ctgctgatcg tgatcccgct 
tcagcgcacg taggtgatga gtttgggctc 
cggacatggc ttcgggttgc gctgatctgg 
tggagcagtc cgccatcgac acggacaacg 
gcgagaaatg tgaccgcgtc ggcgaattcc 
gagctctctg attccctcgc gatgttctcc 
tggtccaact gggcgatgcg cgcagtggcg 
atgccgtcgc ttgccacctc accggacagt 
agggcgttcg acaacccgag gttggggatg 
atgatccggc cccatccccg ctcggtcatg 
acggagagga tcatcgcctc gaagtgccga 
cgggtgggcg gcgggccgcc ggtgatgttg 
tcgacggcgg tgacgttctc atcgatgacc 
atgttatcgc cggtcatgga cgcctcgacc 
agcgccacac gcgcgccttc ccggcttagc 
ccggctccga gtacgagggc gactcttcca 
ggtttcgggt gacggcggcc cggaaagtcg 
ggaggagtgg cagtggcccg gtcagccgcc 
gacaccgggt cggagaccgg cggtgatggt 
ctcgcggtag atcgcgttca tcgcgtcgaa 
ggtgatcacg tctgcctcgc tgcagccggc 
tgcccggcgc acctgaggtt ccagaccacc 
gcactgcccg gcgattgcca cgatgttgcc 
cgcgggagga aggtcgacgt ctgtgaacgc 
tggctggtgg tcatgtccaa ctgacttcgg 
cccgtaggag gtcctgagcc ccgtcagaag 
tcgtcgtcga tgatcgggat gaaggggagc 
tcgaaagcga cgatgtctcc gggagcgatc 
tgctggtcca tcagcttgaa cgtctccgct 
cgaaacagct cgcgtggctg tggcagcgtc 
gacaggatcg cgagatcagg ctccggccgg 
ttgtggagcc ggagcggttc ctgctcggcc 
tacttgccgg cgtcgtggga gatgtaacag 
cggacaccgt cccacgagcc gagtccggcg 
gccgtcacga tgatgcggtt ggcgtcgaac 
gaccggacgc cagcgaagaa ggtgtcgagt 
gtcacgaggc cctcgtaggc ctcgacgccg 
gccgcggcga cctcgagcgc cgagtcgagg 
ccgacctcga ggaggacgtt gagggtgacc 
atcgcccgca ccgtgtccgc atcgtcgacc 
gcgatttcgg cagtcaccca acgcagcgcg 
atgcgctgaa ccccgtagcg gcgcatcacg 
gcccacgcac cgtgagacgc ctgaagggcg 
tgcggggcga gaagcgcgtt gttgcgctcg 
ttcatagcgc gtgcgtggag tgtggtcacc 
cagttctgct ggtccacctc tccgaggagc 
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ttcactgcga ccggctcgcc gatgtcgccg 38160 
ggcacggagt tgcgcatgtg gtggtttgtc 38220 
tcgacaagtt cttgtgcctc ggcgacggtc 38280 
ttgccggctg cgatcgccgc aagtgcttga 38340 
tacacggcgt cgacgtcagg atcgctaagc 38400 
ccgaatggtg ccgctacatc ctgcgctcgg 38460 
gcggcgtcag gctgggcgga gatcgctggg 38520 
cccacgagcg accagcggaa gggacggttc 38580 
accgcgcctg tgcgacattt ccgctggcgc 38640 
aggattacac gaccatgaaa cgggtcgccg 38700 
ataccccgcc tgatctcgct tgcccacgag 38760 
ttgtcagcgc tccttcccca cccggctgca 38820 
agcccgacat cgccgtcggc tggctggatg 38880 
gtacgccggt gtttgcgctc ctgccaccgg 38940 
tagctgaagg tcgcgttcgg gcatccatac 39000 
tccggatagc tttcaatgtg gggaaaagct 39060 
ccgctcgcgc gcagcgtgca ccctggaggc 39120 
ggcgagtggg tagccctgga tgaccgtaga 39180 
gggatcgctg acgtcagtga ccccgaggag 39240 
agacaggatg tttgtcgggg ttacacgttc 39300 
gatccggtga tgtagctggc acgctcgctg 39360 
tcgcgccgcc cgtagcggcc gaccggaata 39420 
gcagtgcggt tctcccgtgc ggcctttgcc 39480 
atgcgccccg gaacgacgat gttgctggtg 39540 
gtcttggacc agcccaccag actcgagcgg 39600 
ggcgcgatga cgccggacga agtcgacgtg 39660 
cccgggagca cgcggtctgt gatcgcgacg 39720 
cgccacaggg tggcgtcctg cccggacaca 39780 
accaggatct ccaccgggcc gagctcgctc 39840 
gacaggttcg ccaagtccca gaccaggctc 39900 
tccgtgagcg cgtccttgtc gatgtcggcc 39960 
gaggcggcaa tggccgaacc gagtccgccg 40020 
ttgattgcga gatccatgac atgtccttgc 40080 
tggcacgccg ttgatggagg gacattggtg 40140 
taccacggcc tgagcgctga cctcgaacag 40200 
cgtccgggcc gggccgggct cgctgaagta 40260 
gtcgtccccg tcagcgagga acacattcac 40320 
tgaacgcagc gcggtcatga ggttctcgaa 40380 
cggtgcgagg gatctgtcgg cgaggtagcc 40440 
gatgcgtacg gcggcggagt aagggcctcc 40500 
ctgacgggac acggtgtcgg ggcctctctg 40560 
gagcttgtgg ggtcggttca accggagcct 40620 
aacgtgagga cgccgtcgac gacgttgaag 40680 
ttgtcgaagg ccgtgcacgg gtgggacatg 40740 
gcgagcccgg gggcaacgcg aacgaaggcg 40800 
tcccgcagcc gctgcctcga gccatccgcg 40860 
aggtcatggg ccgcgtcgcg cttcccgatc 40920 
gacagcacgg aagcccatgc agtgagtgcg 40980 
cgtcggccag cgagcggcga gagcttttcg 41040 
ccgcttcgaa gcactaggtc cacgtcgtcg 41100 
acgacacgat cgaagtacga gcttccgcct 41160 
aggtcgcgac gtgccagctc gagcacgatc 41220 
gcatggaggt cctccggtgt catgccgccg 41280 
accagccgga gatgctcggt cgcggcgacc 41340 
tctcgcacgc cacacctgcc accaggcact 41400 
ccgtggccgg agtcggcgat ggtatcggac 41460 
aggcagtaga agtcgaacga gctgtcgctg 41520 
ggagcatcga cgagttcgtt ggccaggatg 41580 
gccacctgag tgggcgtcgc tgccgtgaac 41640 
aagagctggg gtgacatggt ggtcttgccg 41700 
cagtacgcct ccatagtcgc caggttggcg 41760 
ggcagggcga ggtctccgcg ggccacgttc 41820 
acacgagcac ccaggggcac gcccttgagt 41880 
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gcgtcgtcga ggacggtctc gcggaggtcg 
agctgctcga ccgcgtgagt gctcatcgtg 
gaccgatgcc gtcgatccgg cactccacac 
gggttccggt ggagatgatg tctcccggaa 
cgaggctggc gggcttgtgg gtcatgtggg 
tcttgaccgt cgagatctcg atgtcgtcca 
gcggtacgag ctccggtccg aagcagaaga 
agcgcggatt gatccgcaag atgtcctcgg 
acacgtagtc catcgcctcg gactcgctga 
cgatctcagc ctcggacgtg gtgcgctcac 
ggccgatgat cgtgtggtcg cacttcacga 
tcaggtccgc tgcgtgatcg ccgtagttca 
ggtacggcgc ggtgaaccgc acctcgtcca 
tgacacgctc acgcagccgc gcgaggtggt 
ccgccagctc ggggtctatc tcccggattg 
ggatctctcc ttccgcagcg gtgatggtcg 
cggtcgaatg gtgaggtagg gggtggggga 
cccgcgctgc gtgtgtgagg agaggacaag 
gttccgggct gcccggtcgg cgtcgccagc 
cgaggcgcag gagatggagt caccggtcga 
cagcaccagg gcgttgtcga ccgcgcgctg 
gatgtggtgg aagacgcgat cctcggcgcg 
gcgcagggat gaggtggcga cctcgtggag 
tgcggtgaac caggctgccg agctctccag 
ggcgcggacc gaggcgctga tgacgcgcag 
gtgccgcaag cggagaagcg cctcgcgcac 
gagcttggcc tcgctcacac gacgcccagg 
gatggcctcg aagacccgat ctgtcagatt 
accgttgccg gaccccaccg cggagtctgc 
gttcagcgtg cccgcagtgc gtggctgatt 
ggaatacggt atatggtcct caccgcgtaa 
atgaagtcct tgggatatgt cgtataccga 
ggccttcgag agcctggatc gactggccac 
cggcatcgtg agcccgttct acacgctggc 
tgcgatcgcg cgctctctgc tgcgtcgaca 
ggtcgaggaa ggccggttcc cgcaggggga 
cgcgcgcacg ctcgccggtc ttggtagcga 
cggcccggtg gagaagctgg tcgacgctgc 
ccgattctcc agcgcggccg aggcccggga 
catcgagaag ctggggcaga actctgttgg 
caccgccggt gatctgtttg ccgacgacta 
gacgctggcc gtgggagaca acggcaacga 
cgaggcgctg actccgcgag gcgtcagcgt 
ggatcatctg ctgcctgcat ccgtgtcgaa 
ggccatcctg gctcgacgcg caaacctggc 
cgagctgggg ctgcgcgcag gtctacgcag 
cgacgacggg atcccactgc gtttcgtcgc 
tcaccagagc ctgctcggcg acccctgggt 
caaccctcgg ttgccagcct tcgtcatcat 
cgttgatctg cagcacgcgg agcccatcga 
ggcgaggccc acgtccggga tctgtcatgg 
ccaggatgac gcgttcgact tcatgcggtt 
cctcgaggtc accgacgcgg gagatccgga 
ccgcaccgac ctgcccaaat acagcgtgta 
tgacgtcgtt gacctgtgga ccgacgacat 
cttcgagcac ttgctgctcg atgccggact 
cggcccggtt tacctcagca accgtgagtg 
agtggtttca atgcgtccca tgcccccgga 
tgcgcatccc gaggtgcacg ggggccccgt 
cgacaacgtt gatgaacctg actggggtga 
ggtgttctgg gcctgcggcg tgaccccgca 
aatgatcacg cactcgacgg gtcacatgtt 
aggcgggggc cccggggtgg tgcc 
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tcggcacatt gcaggagtcg ggtccggtcg 41940 
tagtggcctt tcgcgccgtc gtggtcagct 42000 
ggtctccgtc gccgatgacg agcgcgccag 42060 
acagcggcat catcttcgag tggaagctca 42120 
cgaccgtgtt ggtcctgaac tcatcgccgt 42180 
agctttcgaa cttctctatc acctcatcga 42240 
acgtccggaa gttcttcgag cgggtgagat 42300 
cggtctggtc caggatggtc gtgacccccc 42360 
cgttctcgca agtcctgccg atgatgagcc 42420 
tctgcgccgg aatcacgatc ggctcatccg 42480 
aggacgccgg ctggtcgggc gccgactcgc 42540 
gaccgatgcc ccagatcttc cggggggatc 42600 
cgcgtgtgaa ggcgaggtcg ggggcgttct 42660 
ctcctgcgat cacgtccatg atggtcgtga 42720 
ctgcgacgcc gcggtcgctc acgacaacgg 42780 
ccaggtgctc ggtcacgaaa gcccgtcctt 42840 
gttggtttaa ctcgttcccg ggggcacgtt 42900 
cgacatcacg tggaggatgt ggtcggcgtt 42960 
cagaacggcg gatgccgtcg cgatgtgctc 43020 
gggagtgtcg cgttctcgga gtacagcggt 43080 
gaggatctcg ttcccgcaga actcggcgat 43140 
gaactgggcg gagtcgccgg actctgccgc 43200 
cctctcccgt tgggcgctgg tagcccgatg 43260 
ccccgcccgg tgctcgtagg cgtcgcgaat 43320 
ccccttcggg gcagcctcga ccaggcccac 43380 
gggggtcttg ctgacattca gctcggccgc 43440 
tgggagtgcc tgacttacga tgctgtcgcg 43500 
gtcgacgcgt gcgtcgaggg gtgcgagcat 43560 
ggcggctggg cgatggtgca gctccaccat 43620 
cggttccatc gtcccccttc cttggacagt 43680 
gcaaccgccc gaggcgggac tccgtagcgg 43740 
gggagagctg atgaagcccg agtcgcaccc 43800 
gatcgagatg cgtccgccga cgagcccggc 43860 
gcggggtgag cggtcggagc cgatcagcct 43920 
gggtgaaaag gcgttgatcg tcacgggtct 43980 
ggtggacggc cccatcggct ctctggcact 44040 
ggtgaccatc gtcatcgacc cagaggccgt 44100 
gggcctggat ggcgtgacct tgatggagtc 44160 
gttcgcggcg cagttcggcg tcgtcgccgc 44220 
cggccgtcat ctgatctggg gcacgccggt 44280 
ccttcgcgcc gccacggaca acggtgctct 44340 
gatcgggttc ggcaacattg ccgccgagtc 44400 
tgagggcgga ttcttcgcgt ccacgacggt 44460 
tttgggctgc tacgtcatca ccggtgcggt 44520 
agtcaccggg gatctggtgc gcgagtggac 44580 
cggcggcgtt gacgatcccg cgttccaagg 44640 
cgcgcatgcc gagctgatct ccggaatcgt 44700 
ggcagaccag tgacggccac ccgtagcggg 44760 
cgccgggaag gggaagtcat cgatggagtt 44820 
ggcatggcgt gccaaccgcg cgaatgcgtg 44880 
attcgtgcag gccaacctcg tgatgctgcc 44940 
cgcccagcgg aaccccaagc cgtgcccgat 45000 
gccgaagctg accgcccctg gtgctgacct 45060 
ccgggaaggg gcgctggtgg aggagcgcag 45120 
ggtcgcgttc ctgcttggct gcagcttctc 45180 
cccgatccgg cacatcgacc aggggatcaa 45240 
tgtgcccgca ggccagttcc gggggaagct 45300 
ccgggtggcg gaggctgtca ccatcacgca 45360 
gcacatcggc tcccctgaga agttgggcat 45420 
acgagtgcgt atggagccgg gcgacgtgcc 45480 
gcaggcggcg atcgaggcga agaccgagct 45540 
cgtgaccgga atgcggatcg actccgagtg 45600 

45624 

Page 46 



LeA33955-US 



<210> 5 
<211> 50000 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 5 

gggcgcgccg caattcgatg acgttcatgc 
cagcagaggc tgaacttact ggtggtgtgt 
cgaagccggg aacctgatag ccgtcatcgg 
cccggcttcc ttctggcggt tgctgcgcac 
agggcggtgg ggcgacccgt tgcctggtcg 
tttcctggct gatgtcgact gcttcgatcc 
ggcagccgtg gatccccagc agaggctggc 
cgccggtatc cccgccggcg agctgcgcgg 
ctctgacgac tacgccgccc tgctgcgcga 
cctcaccggc acccatcgaa gcctgatcgc 
cgggccaagc ctgacggtgg attcaggtca 
cagcgagagc ctgcgacggg gtgagtgcac 
cctggccgcc gagagcaaca gcgctctgat 
ctgcttcacc ttcgatgtgc gggcgaacgg 
cgtgctgaag aaggccgatc aggcgcacgc 
cggcagcgcg gtcaacaacg atgggggcgg 
ccaggcggag ctgctgcgcc aggcataccg 
gtatgtcgag ctccacggca gcgcgaccag 
cggagctgtc ctgggggcgg cgagacggcc 
gaccaacgtc ggccatctgg aagcagcggc 
cagcatctgg caccgcgaac tgccgccgag 
cccgctggac gaattgaacc tacgcgtcca 
ggggccgctg ctggccggcg tcagcgcctt 
gctctccggc acgtcccggg tggagcgacg 
gtgggtcttg tcggccagaa caccggtcgc 
gcacctcaat acggccggtc aaagtccgtt 
atccgcgctg ccgcaccggg ccgcgctggt 
gttgaaggcc ctcgctgacg gcgacgacgc 
cgagcgggca gcggtcttcg tctttcccgg 
gcagctgctc gaaacctccg aggttttcgc 
ggcgccgcac ctggattggt ccctgctgga 
ccttgaccac gacgatgtcg tccagcccgc 
gctctggcgt tcgtggggcg tgcgtccggt 
cgcggcggcc tgcgtcgccg gggccctgtc 
gcgcagcagg cttctgacgg cgctggccgg 
cgccgaagag gtgcggcaaa tcctgttgcc 
gaacggaccg tcgtcgaccc tggtgtcagg 
cgagtgcgca gaccgagagc tccggatgcg 
gcctcacatc gaggttgtcc gggatgagct 
gacgggaagc atcccgatct attcgacgac 
cgccgactac tggtaccgca accttcgtca 
cctgttgaag cgggggtacg acgcattcat 
gaacatccag gaaaccgccg tgcgagcagg 
ccgcggcgaa ggtggcatgc ggcaggcgct 
agtggccgcg gactggcacg cggtcttcgc 
gacgtacgcc tttcagcgac agcgctactg 
gcccgagagc gacgtgtcga cggcgttgcg 
cgtggactcg acgaccctca cgatgatccg 
cgatccgaaa gaggtggacc cggatcggac 
ggtggtcgag ctgtgcgacc gcctaaacgc 
cgttttcgac tgtcctacgc cggacaagct 
cgagccggct cccatgacgt cacaccggcc 
gatcgggatg ggctgtcggt ttccgggtgg 
ggtcgccgct gggcgggacg tcgtgtccga 
gcgtgcgggg acatcgcacg tgcgcgccgg 



gccgtgtcgg ggaatcgccg gtggcggcgc 60 
ccaggaatcg gaggggcagt accgaatgag 120 
actgtcctgc cgcctacccc aggcgcctga 180 
cggaacggac gccatcacca cggtcccgga 240 
ggatgcgccc aagggcccgg aatggggtgg 300 
cgagttcttc gggatctcgc cgcgagaagc 360 
tctggagctc gcctgggagg cactcgaaga 420 
tactgccgcc ggtgtgttca tgggggcgat 480 
gagcccgccg gaagtggctg cgcagtaccg 540 
caaccgcgtg tcctatgtgc tcggcctgcg 600 
gtcctcgtcc ctggtcggcg tgcatctcgc 660 
gatcgcactc gccggcggcg tgaacctcaa 720 
ggacttcggc gcgctctccc cggacggtcg 780 
ttacgtccgt ggtgagggcg gcggccttgt 840 
cgatggcgac cggatctact gcctcatccg 900 
tgccgggctc accgttccgg cggcggacgc 960 
gaacgcgggc gtcgacccgg ccgccgtgca 1020 
ggtcggggat cccgtcgaag cagcagccct 1080 
cggcgacgag ctgcgtgtgg ggtcggcgaa 1140 
gggcgtcacc gggttgctga agaccgcact 1200 
tcttcatttc accgccccca acccggaaat 1260 
gcgtgatctg cggccgtggc cggagagcga 1320 
cggaatggga ggcacgaact gccacctggt 1380 
gcgcagtgga cccgctgagg cgaccatgcc 1440 
attgcgtgcg caggcggcgc gcttgcacac 1500 
ggacgtcgcc tactcactgg cgaccactcg 1560 
cgcggacgac gaaccgaaac tgctcgccgg 1620 
gcccacgctg tgccacggcg cgacttccgg 1680 
acagggcagc cagtggatcg ggatgggtag 1740 
ggcgtcgatg tcggactgcg ccgacgcatt 1800 
tgtgctgcgc aacgcggccg gcgctgcgca 1860 
gctgttcgcc atcatggtct cgctcgcgga 1920 
ggcggtcgtc gggcactcgc agggggagat 1980 
cgtccgcgat gccgccaggg tggtggcggt 2040 
cagtggcgcg atggcctcgt tgcagcatcc 2100 
ctggcgcgat cggatcggcg tggcgggggt 2160 
ggaccgggag gcgatggcgg aactgctggc 2220 
ccggattccc gttgaatacg cctcccattc 2280 
gctggggctg ttggcgccgg tcgaacccag 2340 
gaccggggac ctgctggacc ggccgatgga 2400 
accggtgctg ttcgaagcgg ccgtcgaggc 2460 
cgagatcagc ccacacccgg tgctgactgc 2520 
gcgggaggta gtggcgctcg ggacactccg 2580 
gacgtcgctg gccagagcac acgtacacgg 2640 
cggtaccggg gcgcagcggg tcgacctgcc 2700 
gctggacgcg aagcttcccg acgtcgccat 2760 
ggaaaagctg cggtcttcgc cgagggcgga 2820 
ggcacaggca gccgtggtcc tcggccactc 2880 
gttcaaggac ctgggcttcg attcctcgat 2940 
cgccacaggt ctgcgactcg caccgagcgt 3000 
cgcccgccag gtacggacgt tgttgttggg 3060 
ggactccgat gcggacgagc ctatcgccgt 3120 
ggtgtcctcg cccgaggagt tgtggcagtt 3180 
gttcccggct gaccgaggtt gggacctgga 3240 
cgggttcttg catggcgccc cggattttga 3300 
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ccccgggttc ttccggattt cgccgcgcga 
gctgctggaa atcgcctggg aagcagtcga 
cggaagtcaa accggggtct tcgtcggcgc 
cgaagcgtcc gaggaggcgg ccgggtacgt 
gggtcgggtt gcgtattcgt tcgggttcga 
ttcgtcgtcg ttggtggccc tgcatttggc 
tctggcgttg gccggtggtg tgaccgtgat 
gcggcagcgt ggtttggcgc cggatgggcg 
caccggctgg tccgagggtg ctggcctggt 
gaatgggcat gaggtgctgg cggttgttcg 
gaatggtttg accgcgccga atggttcgtc 
gagtgcgggg ttgtcggtgt ccgatgtgga 
gcttggtgat ccgatcgagg cgcaggcgct 
ggaacggcca ttgtggttgg gctcgatgaa 
ggggatagcc ggcgtcatga agatggtgat 
gttgcacgtg gatgagccga cttctggggt 
tacggagaac acgccctggc ccgggagtgg 
cgggatcagt ggtactaacg cgcacgtcat 
gtctgcgggg ccgggttcgg gctctgtcgt 
gggcaaaaca cccgaagcgc tatccgcgca 
gcgacctgat gtctcctcgc tggatgttgg 
ggatgagcga gcggtggtgc tggggtcgga 
gctgtctgcc ggtcatgagg cttctgggtt 
catcgggttt gtgttttccg gtcagggtgg 
ccgggctttt ccggtgttcg ctgctgcctt 
tctgggccag gaaatcgggg ttcgggaggt 
tcggacgttg tgggcgcagt cgggtttgtt 
ggattcgtgg ggggttcggc cgagtgtggt 
ggcgttcgcg gcgggtgtgg tgtcgttgtc 
ccggttgatg caggcgttgc cgtctggcgg 
gctgttgtgg tcgttgttgg ccgatcaggg 
tgcggggtcg gtggtgctct ctggtgatcg 
ggacgggcaa gggatccggt cgaggtggtt 
gatggatccg atgctggcgg agttcgccga 
tgaagtgccg atcgtgtcga ccttgaccgg 
gcccgactac tgggtgcgtc aggtgcgaga 
gctggtcgag cacgatgtgg ccaccgttgt 
gctgatccag gaatgtgtcg ccgcatccga 
gatgcgcagg aaccaggacg aggcgcagaa 
acgtggtggt gcggtggact ggcggtcgtt 
gctgcccacc tacgccttcc aacgacagcg 
cgccggcgac atgggcaggc gtgtcgaagc 
tgtggaatcg cttgcacgcg tattgggcat 
gagaagcgcc cttccggtgt tggccggttg 
ggatcagcgg tgttaccgaa ttggctggcg 
tgttttcggt acctggctgg tcttcgcgcc 
ggactgcgtt acggcactgc gggcacgtgg 
cgacccgacc tccttcggcg accgggtacg 
tggcgtgttg tcaatgttgt gcttggaaga 
acggggtttt gcgttgaccg tggagttggt 
ccggttgtgg ttgctgacgt gtggtggcgt 
gcaggccctg gcgtgggggt tggggcgtgt 
cggcttgatc gatattccgg tcttgttcga 
gctggcaggt ctcgatgagg acgaggtcgc 
gttggtacgc cacactgtct cagctgatgt 
gctggtgacg ggcggcacgg gtggtttggg 
cggagccgaa catgtggcga tggtgagtcg 
gttgcggacg gaactggagg atctgggtac 
cgatcgcgag gcgctcgccg aagtgctgaa 
ggtagtgcat gcggcaggcg tgatcgagac 
tttcgatcac gtgttgtccg caaaggtggc 
cgatgtggaa ttggacgcgt tcgtcttgtt 
gggacacggg gcttacgcag cggcgaatgc 
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ggcgttggcg atggatccac agcagcggtt 3360 
acgaggcggg atcaacccgc agcacctgca 3420 
gacctccctg gactacgggc cacgcctgca 3480 
gctcaccggc agcaccacga gtgtggcgtc 3540 
gggccctgcg gtgacggtgg atacggcgtg 3600 
gtgtcagtcg ttgcgttcgg gtgagtgtga 3660 
ggccacgccg gggatgttcg tggagttttc 3720 
gtgcaagtcg ttcgcggagg ccgccgacgg 3780 
tctactggag cggttgtcgg atgcccggcg 3840 
tggtagtgcg gtgaatcagg acggtgcgtc 3900 
gcagcagcgg gtgattgccc aggcattggc 3960 
tgctgtggag gcgcatggga cgggcacgcg 4020 
gatcgccacc tacggccagg gccggcttcc 4080 
gtcgaacatc ggtcacgcgc aggcagctgc 4140 
ggcgatgcgg cacgggcagc taccgcgcac 4200 
ggattggtcg gcggggacgg ttcaactcct 4260 
tcgtgttcgt cgggtggggg tgtcgtcgtt 4320 
cctcgaacag cccccgggag tgccgagtca 4380 
ggatgttccg gtggtgccgt ggatggtgtc 4440 
ggcaacggcg ttgatgacct atctggacga 4500 
gtactcgctg gcgttgacac ggtcggcgct 4560 
ccgtgaaacg ttgttgtgcg gtgtgaaagc 4620 
ggtgaccgga tctgtggggg ctgggggccg 4680 
tcagtggctg gggatgggcc gggggcttta 4740 
tgacgaagct tgtgccgagc tggatgcgca 4800 
ggtgtccggt tcggatgcgc agttgctgga 4860 
cgcgttgcag gtgggcttgc tgaagttgct 4920 
gttggggcat tcggtgggcg agttggcggc 4980 
gggtgcggct cggttggtgg cgggtcgtgc 5040 
tgggatgctg gcggtgcctg ctggtgagga 5100 
tgatcgtgtg gggatcgccg cggtcaacgc 5160 
ggatgtgctc gatgaccttg ccggtcggct 5220 
gcgggtgtcg catgcgtttc attcgtatcg 5280 
attggcacga accgtggatt accggcgttg 5340 
agacctcgat gacgctggca ggatgagcgg 5400 
gccggtccgc ttcgccgacg gtgtccaggc 5460 
cgagctcggt ccggacgggg cgttgtcggc 5520 
tcacgccggg cggctgagcg cggtcccggc 5580 
ggtgatgacg gccctggcac acgtccacgt 5640 
cttcgccggt acaagggcga agcaaatcga 5700 
gtactggctg aacgcgctgc gtgaatcttc 5760 
gaagttctgg ggcgccgtcg agcacgaaga 5820 
tgtggacgac ggcgctgctg tggattccct 5880 
gcagcgaacc cgcaccaccg agtccattat 5940 
gcaggtagcc ggactcccgc cgatgggaac 6000 
tcatggctgg tccagcgaac cggaggtggt 6060 
tgcctcggtg gtgttggtgg aagctgatcc 6120 
aaccctgtgt tcgggccttc cggatcttgt 6180 
atcggtcctt ccgggatttt ctgcggtgtc 6240 
gcgggttttg cgggcagctg gtgcgactgc 6300 
gtcggtggga gatgtaccgg ttcgtccagc 6360 
tgtggggttg gagcatccgg actggtgggg 6420 
cgaagacgct caagagcggt tgtcgattgt 6480 
gatccgtcct gacggcatgt tcgcgcgtcg 6540 
gaagaaggcg tggcgcccca ggggatcggt 6600 
ggcgcacgtt gctcgctggc tggccgacgc 6660 
acgcggcgag caggcaccga gtgctgagaa 6720 
ccgggtgtcg atcgtgtcat gcgatgtgac 6780 
agcccttccg gctgaaaacc cgttgaccgc 6840 
tggtgatgcg gcggcaatga gcctggctga 6900 
cggtgccgcg aatctggatg ccttgttggc 6960 
ctcatcggtg tcaggagttt ggggcgctgg 7020 
ctatctggat gcgctcgcgg aacagcgtcg 7080 
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gtcgcgaggg ctggtcgcga ctgcggtggc 
ctccggagaa acaggagacc agctgcgccg 
cgccatcgcc ggaatccggc aggccgtgga 
tgtcgattgg gcacgtttca gcgcgggatt 
cgaactggcc gaggtcaagg aactcctcgt 
cgacgcgtcg ttggaatggc ggcagcgatt 
gctgatcctg gagctggtac gcggcgaaac 
ggccgttgca tcggaacgag ccttcaagga 
actccgcgtt cggctcaatc gagctaccgg 
ccatcccacg cctgcggaac tggctgcgga 
aggagcaggc attcccgagg aggacgaggc 
gttcccggcc ttgcgcgagg caggcttggt 
ggtcgactcc ggtatctcct cggacgatgc 
agccggcctc gtcgaagcag cgctgggcga 
atgacggtga ccaccagtta cgaagaagtt 
aacgaacgcc tccggcgcgg cagggatcgg 
atcgtggcga tgagttgtcg ttatcccggt 
ctggctgccg gcggtgtgga cgcgatctcc 
gatggcgtgt tcgttccgga ctccgatcgt 
tttcttcagg gcgtgtcgga gttcgacgcg 
ctggcgatgg atccgcagca gcggttgctg 
gctgggctgg agcagcggtc gacacgcggt 
ggccaggact acgcgtcgtg gttgcggacg 
acgggcggtg cggcagcggt tctttcgggc 
cctgcggtga cggtggatac ggcgtgttcg 
caagcactgc gggccggtga gtgcgacctt 
acgccgaagg tgttcctgga gttctcccgc 
aagtcgttcg cggcgggtgc ggatggcact 
ctggagcggt tgtcggatgc ccggcggaat 
agtgcggtga atcaggacgg tgcgtcgaat 
cagcgggtga ttacccaggc gttggcgagt 
gtggaggcgc atgggacggg cacgcggctt 
gccacctacg gccgtgatcg tgatcctggc 
aacatcggtc atacgcaagc ggcggcgggt 
atgcggcacg ggcagctgcc acgcacgttg 
tggtcggcgg ggacggttca actccttacg 
gttcgtcggg tgggggtgtc gtcgttcggg 
gaacagcccc cgggagtgcc gagtcagtct 
gttccggtgg tgccgtggat ggtgtcgggc 
acggcgttga tgacctatct ggacgagcga 
tcgctggcgt tgacacggtc ggcgctggat 
gaaacgttgt tgtgcggtgt gaaagcgctg 
accggatctg tgggggctgg gggccgcatc 
tggctgggga tgggccgggg gctttaccgg 
gaagcttgtg ccgagctgga tgcacatctg 
tccggttcgg atgcgcagtt gctggatcgg 
ttgcaggtgg gcttgctgaa gttgctggat 
gggcattcgg tgggcgagtt ggcggcggcg 
gcggctcggt tggtggcggg tcgtgcccgg 
atgctggcgg tgcctgctgg tgaggagctg 
cgtgtgggga tcgccgcggt caacgctgcg 
gtgctcgatg accttgccgg tcggctggac 
gtgtcgcatg cgtttcattc gtatcggatg 
gcacgaaccg tggattaccg gcgttgtgaa 
ctcgatgacg ctggcaggat gagcgggccc 
gtccgcttcg ccgacggtgt ccaggcgctg 
ctcggtccgg acggggcgtt gtcggcgctg 
Qccgggcggc tgagcgcggt cccggcgatg 
atgacggccc tggcacacgt ccacgtacgt 
gccggtacgg gagcgaaaca aatcgagctg 
tggctggtgc catcggattc cggtgatgtg 
ccgctgttgg gtgctgtggt gccggtcgcg 
atttcggtgc ggacgcatcc gtggctggcc 
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ctgggggccg tgggccggcg agggcatggc 7140 
atacggcctt tccccaatgg ctccgcagca 7200 
acaggacgaa atttccctgg tagtggccga 7260 
gctggcggct aggccgcggc cgctgctgaa 7320 
cgatgcccag cccgaggcgg gagtccttgc 7380 
gtccgcggca ccgaggccga cacaggaaca 7440 
cgctctggtg ctgggacacc ccggggcagc 7500 
cagcggattc gactcgcagg ccgcggtcga 7560 
cctccagttg ccatcgacaa ttatcttcag 7620 
gctgcgggcg aggcttcttc ccgagtccgc 7680 
gcgaatcaga gcggcactga cgtcgatccc 7740 
gagtccgctg ctcgcacttg ccggacaccc 7800 
ggccgcgacc tcgatcgatg cgatggatgt 7860 
acgcgagtcc tgagaccgcc gacctgggag 7920 
gtcgaggcac tgcgagcatc gctcaaggag 7980 
ttctccgcgg agaaggacga tcccatcgcg 8040 
caggtctcct cgccggagga cctgtggcaa 8100 
gaagttccgg gggatcgcgg atgggacctg 8160 
cctggcacgt cgtatgcctg cgcgggcggt 8220 
ggtttcttcg ggatttcgcc gcgtgaggcg 8280 
ctggaagtcg cgtgggaggt cttcgagcgg 8340 
tcccgcgttg gcgtgttcgt cggcaccaat 8400 
ccgccgcctg cggtggcagg tcatgtgctg 8460 
cgggttgcgt attcgttcgg gttcgagggt 8520 
tcgtcgttgg tggcgttgca cctggcgggg 8580 
gcccttgccg gtggcgtcac ggtgatgtcg 8640 
caacggggtc tcgcgccgga tgggcggtgc 8700 
ggatggggtg agggtgccgg actgttgttg 8760 
gggcatgagg tgctggcggt tgttcgtggt 8820 
ggtttgaccg cgccgaatgg ttcgtcgcag 8880 
gcggggttgt cggtgtccga tgtggatgct 8940 
ggtgatccga tcgaggcgca ggcgctgatc 9000 
cggccgttgt ggttggggtc ggtcaagtcg 9060 
gtggctggtg tgatcaagat ggtgatggcg 9120 
cacgtggaat cgccgtcgcc ggaggtggat 9180 
gagaacacgc cctggcccag gagtggtcgt 9240 
atcagtggta ctaacgcgca cgtcatcctc 9300 
gcggggccgg gttcgggttc tgtcgtggat 9360 
aaaacacccg aagcgctatc cgcgcaggca 9420 
cctgatgtct cctcgctgga tgttgggtac 9480 
gagcgagcgg tggtgctggg gtcggaccgt 9540 
tctgccggtc atgaggcttc tgggttggtg 9600 
gggtttgtgt tttccggtca gggtggtcag 9660 
gcttttccgg tgttcgctgc tgcctttgac 9720 
ggccaggaaa tcggggttcg ggaggtggtg 9780 
acgttgtggg cgcagtcggg tttgttcgcg 9840 
tcgtgggggg ttcggccgag tgtggtgttg 9900 
ttcgcggcgg gtgtggtgtc gttgtcgggt 9960 
ttgatgcagg cgttgccgtc tggcggtggg 10020 
ttgtggtcgt tgttggccga tcagggtgat 10080 
QQQtcggtgg tgctctctgg tgatcgggat 10140 
gggcaaggga tccggtcgag gtggttgcgg 10200 
gatccgatgc tggcggagtt cgccgaattg 10260 
gtgccgatcg tgtcgacctt gaccggagac 10320 
gactactggg tgcgtcaggt gcgagagccg 10380 
gtcgagcacg atgtggccac tgttgtcgag 10440 
atccaggaat gtgtcgccgc atccgatcac 10500 
cgcaggaacc aggacgaggc gcagaaggtg 10560 
ggtggtgcgg tggactggcg gtcgttcttc 10620 
cccacctacg ccttccaacg acagcggtac 10680 
acaggtgccg gtctggccgg ggcggagcat 10740 
ggtggtgacg aggtgttgct gaccggcagg 10800 
gaacaccggg tgctgggtga agtgatcgtt 10860 
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gcgggcaccg cgttgctgga gatcgccttg 
gtggaagagc tcaccctgga agcaccgctg 
cagctgcgag tgggcgcgcc cgagaattcc 
cccgaagggg cggcggagca tgactggacg 
cgcggcgagg cggctggaga cctggccgac 
ctcgacgaat tctatcggga cctcgcagag 
gggctcaagg cggcctggcg gcaaggggac 
acggaagatt ctggtttcgg ggtgcatccg 
gctgtccgag acatggatga cgcacgcttg 
gccaaggccg cgccggcttt gcgggtccgc 
ctgctggttt gtgatggcac cggtcgaccg 
tcggctgcgg cccggcggac cggtgcgcgc 
ttgagctggc caacggttca actgccgaca 
ggcacctcag aagtgtccgc tgacatacag 
gcgttggatg ccggtgccga accacccggc 
ggacgaacag cggatgtccg ggagacgact 
ctttccgatc agcgactcaa cgaatcccga 
gtggagccgg gcgaacccgt gaccgatctg 
tcgacgcaga ccgaacaccc tgatcgcttc 
ctcctccccg cgctgccggg ggtgctggcc 
ggcggcgctc atgcgcccag actggctgga 
gacggcaccg ggtggcgatt ggaggccacg 
gtggacgaac cgacggccac ggcaccgctg 
gcggccgggg tgaacttccg ggatgcgctc 
tcgctgggca gtgagggcgc cggggtcgtg 
gcacccggcg accgcgtgat gggaatgatc 
gaccatcgca tggtgacgag gattcccgct 
ccgatcgtct ttctcaccgc ctactacgcg 
gagtcgttgc tggttcattc ggccgccggt 
aggcacctcg gtgcagaggt gtacgccacc 
ctgagccgag aacacctcgc ttcgtcgcgg 
gcaaccggcg gacgcggcgt cgacgtcgtg 
gcgtctctgc gaatgctgcc gcgcggtggc 
cgtgaccccg tcgaggtcgc cgatgcgcat 
gtagaggcag gcccgcagcg aatcggcgag 
ggacgcgtgc tggagcccct gcctgtcacg 
ctacggcacc tgagccaagc gcggcatgtg 
tgggacgccg caggcacggt tctggttacc 
gcccggcacc tcgtgatcga gcgcggggtg 
cccgcagcca gtggcgctgc tgagctcgtg 
tccttgcagg cttgcgatgt cgccgatcgt 
ccggacgagc atccgttgac cgccgtggtg 
tccgaatcgc tcaccgtgga gcggctggac 
cggaatctgc tcgagctgat cgacccggac 
ggtgtgctcg gcagcggtgg gcagggtaac 
ttggcgcagc aaaggcagtc gcgcggccta 
gcggaacatg gcatggccag caccttgcgc 
gggttgctgc cgatctcgac cgaggagggg 
gcgcataccg tggtggcgcc ggttcgattc 
ttctccgtcc tgcaaggttt ggtcgggccg 
gatgccgaga gcctccggaa acggttggga 
ctgctggacc tcgtccgcat gcatgtggcg 
atcaccgcgg acggcacgtt caaggtgctg 
cgcaaccgga tcaacggggc gacggggctg 
ccgacgccgg atgcgctcgc cgcgcacctc 
gggacattcg aggaactcga caggtgggcg 
gccacgcggg cgcagatcac cacccggcta 
tccggcggaa ccggcggcgg ctccgtgccg 
cttttccaac tcctcgacaa cgatctcgaa 
aacttcctgg agggaaacgc cacatgtcga 
gtgcgctcgt ggatctgcac caggcgcgcg 
gggaacccat cgcgatcgtg gcgatgggct 
aagggctgtg gaaactggtc gcctccggtg 
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cacgcggggg aacgtcttgg ttgtgaacgg 10920 
gtcctgccgg agcgcggggc gatccaggtt 10980 
ggacgcaggc cgatggcgct gtattcacgc 11040 
cggcacgcca cgggccggtt ggcgccaggc 11100 
tggccggctc ctggcgcgct gccggtcgac 11160 
cttgggctgg agtacggccc gatcttccaa 11220 
gaggtgtacg ccgaagccgc gctgccggga 11280 
gcactgctgg acgcggctct gcacgcaacg 11340 
ccgttccagt gggaaggtgt gtccctgcac 11400 
gtggtcccgg ctggtgacga tgccaagtcc 11460 
gtgatctcgg tggaccgact cgtattgcgg 11520 
cgacaggccc atcaagctcg gttgtaccgg 11580 
tccgctcagc caccgtcctg cgtgcttctc 11640 
gtgtatccgg acctccggtc gttgacggct 11700 
gtcgtcatcg cacccacgcc ccccggcggt 11760 
cggcatgcac tcgacctggt acaaggctgg 11820 
ttgctcctgg tgacacaggg agcagtggcc 11880 
gcgcaggccg cgctctgggg actgctgcgg 11940 
gtcctcgtcg atgtgcctga gcccgcgcaa 12000 
tgcggcgaac ctcagctcgc gttgcgacgt 12060 
ctgggcagcg atgacgtcct gcccgtgccg 12120 
cgcccgggaa gcctggatgg gttggcattg 12180 
ggtgacggtg aggtcaggat tgcgatgcgc 12240 
atcgcgctcg gtatgtatcc cggtgtggca 12300 
gtggagaccg gccccggcgt caccggcctg 12360 
ccgaaggcgt tcgggccgct cgcggtcgcc 12420 
ggttggagct tcgcgcgggc cgcatcggtg 12480 
ctggttgatc tcgccgggtt gagaccaggg 12540 
ggggtgggga tggccgcgat ccaactcgcc 12600 
gctagcgagg acaagtggca agccgtggag 12660 
acgtgcgatt tcgagcagca gttcctcggg 12720 
ctcaactccc tcgccgggga gttcgccgat 12780 
cgtttcctgg agttggggaa gacggatgtt 12840 
ccgggcgtgt cttaccaggc tttcgatacc 12900 
atgcttcacg agctggtgga gttgttcgag 12960 
gcttgggacg ttcggcaggc gcccgaggcg 13020 
ggaaagctgg tgctcaccat gcctccggtg 13080 
ggcggaacgg gagcacttgg cgcagaggtc 13140 
cgaaacctgg tcctcgtcag caggcgcggt 13200 
gcgcaactga cggcctacgg tgccgaggtt 13260 
gagaccttgg cgaaggtgct tgccagcatc 13320 
cacgcggctg gtgttctcga cgacggagtg 13380 
caggttctgc gcccgaaggt cgatggcgcg 13440 
gtggccctcg tgttgttctc gtcggtgtcg 13500 
tacgcggcgg ccaactcctt cctcgacgca 13560 
ccgacgagat cattggcctg ggggccctgg 13620 
gaagccgagc aggatcgatt ggcgcgatct 13680 
ttgtcccagt tcgacgccgc gtgcggcggc 13740 
agccgcttgt ccgacgggaa cgcgatcaag 13800 
catcgcgtca acaaagcggc gactgcggat 13860 
cgcttgccgg atgcagaaca acatcggatt 13920 
gcagtgctcg gattcgccgg ttctcaggag 13980 
ggcttcgact cgttgaccgt ggtcgagttg 14040 
cgactgcccg ccaccctggt gttcaactac 14100 
gtcaccgcgc tgtccgcaga ccgcctggcc 14160 
gcgaacctgc ccacgctggc cagggatgag 14220 
caggcgatct tgcagagcct ggcggacgtg 14280 
gaccggctca gatcggccac ggacgacgag 14340 
cttccctgat gcctcagccg gagccttcgc 14400 
atgaagagaa gctccgggag tacttgcggc 14460 
agcggctgca cgaggcggag tcgggagagc 14520 
gccggtaccc gggtggggtg caggacccgg 14580 
gcgacgccat cggtgaattc cccgctgatc 14640 
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gtggttggca cctcgacgag ctctacgatc 
cccggcacgg cggcttcctc cacgacgccg 
gcccccgtga ggcgctcgcg atggacccgc 
agaccgtcga atccgctggg atggacccga 
tcgcgggatt gatgtacgag ggctatgaca 
aaggctatct cggaaccggc aatgcgggaa 
tcgggttcga gggcccagcg gtgacggtag 
tgcatttggc gtgtcagtcg ttgcggcagg 
tgacggtgat gtcgacgccg gagaggttcg 
cggatgggcg gtgtaagtcg ttcgcggcgg 
ccggtttggt gttgctggag cggctgtcag 
cggttgttcg tggtagcgcg gtgaatcagg 
acgggctggc ccaggagcgg gtcattcagc 
ccgatgtgga cgctgtggag gcgcatggaa 
cgcaggctct gatagccgcc tatggacagg 
ggtcggtcaa gtccaacatc ggtcatacgc 
agatggtcat ggcgatgcgg cacggggagc 
attcgcacgt ggactggtcg gctggtgcgg 
cagggacggg tacgcgccgc gctggagtgt 
acgtcatcct cgaacacgac ccgctcgccg 
cagcacctgg gatcgtgccc tgggcgttgt 
aggccgaacg gctgcgcgag ctgtgcgagc 
tctcactggc cgccacgcgc acggcttggg 
gcgctacgtt gcgctccggg cttggcgttg 
ttgaggggag cgtcctggac ggcgaggtcg 
gggccggtat ggcagtcgac ctgctggacg 
agtgcgccac cgcgctgcgg aggtacgtgg 
cggagaactc cccaccgctg gaccgggtgg 
tggtgtcgct cgccgaggtg tggcgttcct 
acagtcaagg cgaaatcgcc gcggcctgcg 
ccaggcttgt cgcattgcgc agcagagcgt 
cgtcgctggc ctgccctgcg gatgaggtcg 
tggaagttgc ggcgatcaac ggcccgcgat 
tggacgaact gctggcagag tgcgctgaaa 
actacgcctc gcattcagcg cacgtggagg 
ccggggtgcg acaccgggac ggccaggtgc 
tggatccggc caggctggac ggcgagtatt 
tcgaacacgc cgtgcagggc ctggtcgagc 
cgcatccggt gctgaccacg gcggtcgagg 
ccgcggtagg taccttgcga cgtgactcgg 
ccgaggcgta cgtgcgcggc gccaccgtgg 
gacggctgga cctgccgacc tacccgttcc 
ctgcctccga cgaggctcgt gcggtctcgg 
tggcgcgcca agacctgaaa agcgtgtccg 
tgagcgcaac acttccagcc ctgtccgtct 
cagacggttg gcggtaccga gtcgactggg 
cgcgggaaac ctggctcctg gtcgttcccc 
ggctgacgga tgcgttgaac acgcgaggga 
cggcgaccag tggcgaactc gcaaccgaac 
aggcaatcct gtcgctcacc gcgttggacg 
cgagcgggat tgccttgctg ctgaacctgg 
ttcctctgtg gaccatcacg cgtggtgcgg 
gcccgatgca ggcgcaagca tggggtctgg 
gctggggtgg gctgatcgac ctgccggatt 
gcgaagcgct caccaacggc ttggcggaag 
tggcccggcg actggtaccc gccccggcga 
gagggagcgc gctgatcacg ggcggactcg 
tggccgaaat cggagccgag cgaatcgtgc 
gcgccgccga gctggaagcc gaactccggg 
gcgacgtgac cgatcgtgcc gagatgtccg 
tgttccacgc ggccggagtc ggtcggctgc 
tggccgaaat atgcgcggcg aaggtccgcg 
gcaccgatct cgatgccttc gtcctgttct 
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ccgacccgga tcagcccgga acctgctaca 14700 
gcgagttcga cgcgggattc ttcgacatca 14760 
agcagcggct gctgctggaa atctcctggg 14820 
ggtccttgcg ggggagccgc accggggtgt 14880 
ccggcgccca ccgggcagga gaaggtgtcg 14940 
gcgtcgcctc tggtcgggtt gcgtatgcgt 15000 
acacggcgtg ctcgtcgtcg ttggtggcgc 15060 
gcgagtgtga tctggcgctg gccggtggag 15120 
tggagttctc ccgtcagcgt ggtctcgcac 15180 
ctgcggatgg aaccggttgg ggtgagggtg 15240 
acgccaggcg gaacgggcat cgggtactgg 15300 
acggtgcgtc gaacggattg acggccccga 15360 
aggtgctcac gagtgcgggg ctgtcggcgt 15420 
cgggtacgcg gcttggtgat ccgatcgagg 15480 
atcgggaccg ggaccggccg ctgtggttgg 15540 
aggcggctgc gggcgtcgct ggtgtgatca 15600 
tgccgcgcac gttgcacgtg gacgagccga 15660 
tccgactcct gaccgagaac atccgctggc 15720 
cgtcgttcgg ggtaagcggt accaacgcac 15780 
tgaccgagaa cgaggaagca gcgcagtccc 15840 
ccgggcggtc gtcgacggcg ctgcgggccc 15900 
agaccgatcc cgaccccgtc gatgtcggtt 15960 
agcaccgagc ggtggtgctt ggtcgggaca 16020 
ttgccagcgg tgaaccagcg gtcgatgtcg 16080 
tcttcgtctt ccccggtcag ggctggcagt 16140 
cttcgccgac gttcgcgcgc cacatggacg 16200 
actggtcgtt ggtcgacgtg ctgcgcggag 16260 
acgtgctcca gcccgcgtcc ttcgcggtga 16320 
acggggtgag gccggcggcc gtcgtcggcc 16380 
cagccggggt gctgccgctg gaggatgcgg 16440 
tgaagggact ttcggggcgg ggtggcatgg 16500 
cggcattgtt cgcgggatcg ggcggccgtc 16560 
cggtcgtggt gtccggcgat ctggaagcgg 16620 
aggacatgcg tgcacgccgt atccccgtcg 16680 
tggttcggag cccggtgctg gcggccgccg 16740 
cgtggtggtc gacggtgatc ggcgactggg 16800 
ggtatcggaa cctccggcag ccggtccggt 16860 
ggggattcgg cctgttcatc gaaatgagtg 16920 
aaaccggtgc ggagtcggag accgccgtgg 16980 
gcggcctccg gaggttgttg cattcgctgg 17040 
actgggccgt ggcgttcggg ggcgcgggcc 17100 
agcgccagcg gtactggctg gacaagggag 17160 
acccggcggc gggctggttc tggcaagccg 17220 
atgccctcga tctcgacgcc gacgcaccgc 17280 
ggcaccgtca ggaacgagaa agggtcttgg 17340 
tacgggtggc cccgcagccg gtccggagaa 17400 
cgggcggcat cgaggaagcg ctggtcgaac 17460 
tcagcaccct gcgcctcgac gtgccaccgg 17520 
tccgcgccgc agccgacggt gacccggtga 17580 
agcgacccca ccccgaatgc aaggacgtcc 17640 
tcaaggcgct cggtgaagcc gacctcagaa 17700 
tcaaggcagg ccccgcagat cggctgctgc 17760 
ggcgagtagc cgcactcgaa caccccgagc 17820 
cgctggacgg cgacgtcctc acgaggctgg 17880 
accaactggc gattcgccag tcgggcgtgc 17940 
atcagcccgc tggacgtaag tggcgccccc 18000 
gcgcggtggg cgcacaggtg gcgaggtggt 18060 
tcaccagtcg acggggcaac caagcagcag 18120 
cccttggagc gcaagtgtcc atcgtggctt 18180 
cactactggc cgagttcgac gtcaccgcgg 18240 
tgccgttggc ggagaccgac cagaacggcc 18300 
gcgctcaggt gctggacgaa ctgtgcgaca 18360 
cctcgggtgc cggggtatgg ggcgggggcg 18420 
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gtcagggcgc ttacggcgcg gcgaacgcat 
cacgcggtct gccggcaacc tcgatctcct 
acggcgcggc gggcgaacac ctgcggcgac 
ccatcctggc tctgcaggaa gtacttgacc 
tggactggga ccgattcgtt cccacgttcg 
aagtgccggc ggcgagaaag gcgatgcccg 
cgttcgcccg caatctcgcg gagctgccgg 
tggtgtgcgc ccaggtggca accgtgctcg 
agcgggcgtt ccgcgcgctc gggttcgact 
tgaccaccgc caccgggttg cgcctgccga 
ccgccttggc cgctcacctg ctcgaggagc 
ccgctgccag cgcgcccgcg agtgacgaac 
ttccgggtgg cgcgcactcg ccggaagacc 
tgatcggcga gttcccctcc gaccggggct 
cttccaggcc tggaacgacg tatgcgcgga 
tcgatgccga cctgttcggc atcagcccac 
ggttggtgct cgaaatcgcc tgggaagccc 
tgaagggcag tggggtcggc acgtacatcg 
tgcggcagtt tcccgaggag gcggagggct 
tgtcgggtcg ggtcgcgtat tcgtttggtt 
cttgttcgtc gtcgttggtg gcgttgcatc 
gtgatctggc gttggccggt ggtgtgaccg 
tctcccgtca gcgcggtttg gcgccggatg 
acggcaccgg ctggggcgaa ggcgcgggcc 
accggaatgg gcatcgggtg ttggcggtgg 
cctcgaacgg actggcggcg ccgaacggtc 
tcgcgaatgc ggctctttcg gcgtccgatg 
ccaggctggg tgatccgatc gaggcgcagg 
agcgggatcg gcccttgtgg ctggggtcgg 
cggcgggtgt tgccggtgtg atcaagatgg 
cctcgctgca cgcggatgag cccacgtcgg 
tcctcgccga acaggtacct tggccggagt 
cgttcgggat cagcggcacc aacgcacatg 
atagtacagc ggagacggac aaaacagaat 
cctggttggt gtcgggaaag acgacggatt 
ctcaggtcga gtcccggccg gagcagcgtt 
gccgagccgc gctggatgaa cgcgctgtcg 
ctggactggc ggcgttggcc gccggtcagg 
cttctgctcg gttcgggttc gtgttctcgg 
gagcgctcta ctcgaagttt ccggtgttcg 
tggaggcaca tctgggggaa gaccgccggg 
agctgctgga tcagacgctg tgggcgcagt 
tggggctgct gggttcgtgg ggcgttcggc 
agttggccgc cgcgtttgcg gctggcgtgt 
ccgcgcgcgc ccggttgatg caagccctgc 
ctggtgaaga ccttgttcgg ccattgctgg 
cgctcaatgc ccccggttcg gtggtgttgt 
tcggccggct gaccgagctc cgagtccgga 
attcgcaccg gatggacccg atgttgggcg 
tcggtaagcc aacgacaccg cttgtgtcga 
aaatgagcac accagggtat tgggtgcgcc 
gtgtccaggc cctggcagcg cagggcatag 
cgctggcggc actggttcgg gagtgtgcga 
cgatcccact gatgcgcagg gagcgggacg 
atctccacac ccgtggtggt gaggtggact 
ggcagctcga gttgccaacg tatgccttcc 
cgcggccagc acgcgaccgc gcagacatcg 
ttgaccaagg cgatctggca acgttggtcg 
catgcgcatc gttgagcgat gtattgccgg 
accgttcgct cgtcgattcc tgccggtacc 
cggccccgaa gatttccggt acctggctgt 
tggtcacggc tttgacgagt tcactggtcg 
tgtccgaaga ggacccgcac cgcgaggacg 
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tcttggacac actcgccgaa caacgccgag 18480 
ggggcagttg ggccggcggc ggcatggccg 18540 
gcgggatacg tccgatgccg gcggcgtcgg 18600 
aggatgagac gtgcgtgtcg atcgctgatg 18660 
ccgcgactcg cgccacccgg ttgttcgacg 18720 
cgaatgggcc ggcagaacca ggcggctcgc 18780 
aagcccaacg acgccacgaa ctggtggatc 18840 
ggcacggcag tcgcgaggaa gtccagcccg 18900 
ccctcatggc ggtggatctg cgcaatcgtt 18960 
ccacaaccgt cttcgactac ccgaatccgg 19020 
tggtgggtga tgtcgcgtcg gctgcggtga 19080 
cgatcgcgat cgtcgcgatg agctgccggt 19140 
tgtggcggct ggtcgccgcc ggcacggagg 19200 
gggatgcgga aggcctttac gatccggatg 19260 
tggcgggatt cctctacgac gccggtgagt 19320 
gtgaggcgtt ggcgatggat ccgcagcagc 19380 
tcgaacgggc cggaatcgat ccgttgtcct 19440 
gcgctggaag ccgtgggtac gcgacggatg 19500 
acctgctgac gggtacctcg gccagtgtgc 19560 
tcgagggtcc tgcggtgacg gtggatacgg 19620 
tggcgtgcca gtcgttgcgt tcgggcgagt 19680 
tgatgtcgac gccggagatg ttcgtggagt 19740 
ggcggtgcaa gtcgttcgcg gagagcgcgg 19800 
tgttgttgct ggagcggttg tcggacgccc 19860 
ttcgtgggtc agcggtgaat caggacggcg 19920 
cgtcgcagca gcgggtgatc aaccaggcac 19980 
tggatgcggt ggaggcacat ggcaccggga 20040 
cattgatcgc aacgtatggg caggcccggg 20100 
tcaagtcgaa catcggtcat acgcaggccg 20160 
tgatggccat gcggcacggg cagctgcccg 20220 
aggtcgattg gtcgtcgggg gcggtccggc 20280 
ctgaccgtgt tcgtcgggtg ggggtttcgt 20340 
tgatcctcga acaagctacg aatgcgccag 20400 
ccggatctac tgtcgatatt ccggtcgttc 20460 
ccctgcgggg acaagccgaa cgagtcttgt 20520 
cgctggatgt tgcctactcg cttgcttctg 20580 
tgctgggtgc ggaccgcggt gagctggttg 20640 
aggcttctgg ggtgatcagc ggaactcgtg 20700 
ggcagggtgg tcagtggttg gggatgggca 20760 
ctgctgcgtt tgatgaggct tgcgccgagt 20820 
ttcgggatgt ggtcttcggt tccgatgcgc 20880 
cgggtctgtt cgcgctgcaa gccggcctct 20940 
cggatgtggt gatggggcat tcggtcgggg 21000 
tgtcgttgcg ggatgcggct cggttggtgg 21060 
cctctgacgg cgcgatgttg gcggtggctg 21120 
ccggtcggga ggagtccgtg agcgtcgccg 21180 
cgggcgatcg ggaggtgctg gccagcatcg 21240 
cgcggcgctt gcgggtctcc catgcttttc 21300 
agttcgccca gatcgccgag tctgcggagt 21360 
cgttgacggg tgagctcgac agagccgcgg 21420 
aggcgcgtga acccgtccgt ttcgccgacg 21480 
gcacggtcgt cgagctcggc ccggacggaa 21540 
ccgagtccga tcgggttggg cggatttcgt 21600 
agacccgttc ggtgatgaca gccctggcgc 21660 
ggcaggcgtt tttcgccggt accggcgcta 21720 
aacgacagca ctactggatc gagtccagtg 21780 
gcgaggtggc ggaacagttc tggaccgcgg 21840 
ccgctctgga tcttggggcg gacgacgaca 21900 
cgttgtcctc ctggcgaagc ggactccgca 21960 
gaatcagttg gcattcctct cgggaggtgc 22020 
tggtcgtgcc cggtgctgcg gatgacggat 22080 
gaggcggcgc cgaggtcgtc cggatcggcc 22140 
tcgcacagcg gctggccaat gcgctgacgg 22200 
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atgccggtca actcggtggc gtgctttcgc 
gattctcctg cttgccaact ggtttcgcgc 
aggccgacgt cgaggcgcct ttttgggcgg 
tacgcgtgtc tccggagcag gccctggtct 
acccggagtt ctggggtggc ttgatcgacc 
cccggttggc gggtgtgttg gcggatggtg 
gtgtgttcgt gcggcggttg gaacgcgctg 
ctcgggggac ggtgttggtg acgggtggta 
ggttggccgg tgccggggct gagcacgtgg 
cgggcgctgg ggaattgcgg gcggagctgg 
cctgcgacgt ggctgatcgt gacgcagtgg 
gtccgctgac tgcggtggta cacgccgccg 
tgggtttggc ggattttgca gcggtgttgt 
acgagttgct ggccgactcg gagctggatg 
tgtggggagc cggcggacag ggtgcgtatg 
ccgagcagcg tcgggcgagg ggattggtcg 
gtgacggcat ggccgccggc gaaaccggcg 
tggaaccgag cgcggcgctg ctggcacttc 
tcgtcgtggc cgatgtcgat tgggcacggt 
gcccgctgct ggacaccatc gacgaggccc 
cgggcacagg caaacccgtt gagctgacgc 
gcgacgatgc ggtattggat ctggtgcggg 
atgccacggc cctggcgcca tcgcggccgt 
cggtggagct gcgcaaccgg ctgaacaccg 
ttttcgacta ccccaatgcc gagtcgctgt 
cggagactac cgtggactcg gcccttgccg 
tgctcaccgg cgaagcgcgg gcacgggacc 
agaagtggaa cagcgcagct gaagtaccga 
cggcgacgca cgacgagata ttcgagttca 
ttcctgcgga acttcaagcg ccgaaatcgg 
ctcttcggct atctgaagaa ggtaactgcg 
gcggccgaga gccggagtca ggagccgatc 
ggcggcgtcg actctcccga agcgctctgg 
tcggagttcc ccgccgaccg gggctgggat 
caccagggaa cgtcgtacac gcgggccggc 
cccgccatgt tcgggatttc gccgcgtgag 
ttgctggagc tgtcctggga ggccctcgaa 
ggcagcaaga ccggtgtctt cggtggtgtc 
gagatgagcc gaaacgctgg gggttttgga 
ggtcgggttg cgtattcgtt tggttttgag 
tcgtcgtcgt tggtggccct gcatttggcg 
ctcgcgctgg ccggcggtgt gacggtgatg 
cgtcagcgtg gtttggctcc ggacgggcgg 
accgggtggg gtgagggtgc cggtctggtg 
aatgggcacg aggttctggc ggtggtgcgg 
aatggtttga ctgcgccgaa tggtccgtcg 
agtgcggggc tgtcggtttc cgatgtggat 
ttgggtgatc cgatcgaggc acaggccctg 
gatcggccgt tgtggttggg gtcggtcaag 
ggcgttgccg gcgtcatcaa gatggtcttg 
ttgcatgtgg atgagcccac gtcggcggtg 
acggagaaca cgccctggcc ggacagtggt 
gggatcagcg gcaccaacgc acatgtgatt 
ccggccgggc cggtcgaagg cgagcgggaa 
gtgctgtcgg gtaagacacc ggaggctgcg 
atcgaggacc ggccggggct gtcgccggtg 
gcggcgctgg atgaacgcgc agtggtgttg 
ttgagggcat tcgccgacgg ctgcgatgcg 
ggtggccgcg tcgggttcgt gttctcgggt 
gggctctact cggtgtttcc ggtgttcgcc 
gatgcacacc tgggccagga actgcgggtt 
ttgctggatc ggacggtgtg ggcgcagtcg 
cggctgctgg gttcgtgggg tgttcggccg 
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tgttggggct cgatgaatcg cctgctccgg 22260 

tgactgtgca gcttctgcgg gccttgcgga 22320 

tgacgcgcgg cggcgtcgcg ttggaagatg 22380 

gggggctgct gcgtgtcgcg ggactggagc 22440 

tgccatcgga ctgggacgac cgattgggtg 22500 

gcgaggatca agtcgccatt cgccgtggtg 22560 

gtgcgtcggg tgccgggtcg gtgtggcgtc 22620 

cgggcggttt gggggcgcat gttgcccggt 22680 

tgttgaccag ccgtcgagga gcggacgctc 22740 

aggcgctggg tgctcgggtg tcgattgtgc 22800 

ctggagtgtt ggcagggatc ggtggggagt 22860 

gggtcggcga ggcgggcgac gtagtggaga 22920 

cggcgaaggt gcgtggtgcg gcgaatctgg 22980 

cgtttgtgat gttctcctcg gtgtcggggg 23040 

cggctgcgaa cgcctacttg gatgcgttgg 23100 

ggaccgcggt tgcgtgggga ccgtgggccg 23160 

cacagctgca ccggatgggc ctggcgtcga 23220 

agggtgcatt ggaccgcgat gagacctccc 23280 

tcgccccagc cttcacctcg gcacgtcgac 23340 

gagccgcatt ggaaaccacc ggcgaacaag 23400 

aacgcctggc cggactgtcg cggaaggaac 23460 

cggagacggc ggctgtgctg ggacgcgacg 23520 

tccaggaact cggattcgac tccttgatgg 23580 

ccaccgggat ccagctgccc gccagcacga 23640 

cgcgtcacct ctgcgccgag cttttcccaa 23700 

agctcgatcg aatcgagcag cagctctcga 23760 

gaatcgcgac acgactgcga gccctccacg 23820 

ccggagccga tgtcctgagc acgctcgatt 23880 

tcgacaacga gctcgacctg tcctgagcag 23940 

gtggaaatca caatggccaa tgaagaaaag 24000 

gacctgcatc agacccggca gcgcctgctc 24060 

gcgatcgtct cggcgagctg ccgactgccc 24120 

caactcgtgc gcactggcac cgacgccatc 24180 

ctcggccggt tgtacgatcc cgacccgaac 24240 

ggtttcctcg caggagcggg cgatttcgac 24300 

gcgttggcga tggacccgca gcaacggttg 24360 

cgggcgggca tagacccgac atccctgcgc 24420 

acgccccagg agtacgggcc gtccttgcag 24480 

ctcaccgggc ggatggtgag tgtggcgtcg 24540 

ggtcctgcgg tgacggtgga tacggcgtgt 24600 

tgtcagtcgt tgcgttccgg cgaatgcgat 24660 

gcgacaccgg cgacgttcgt ggagttctcc 24720 

tgcaagtcgt tcgcggctgc cgcggatggc 24780 

ttgctggagc ggttgtcgga tgcgcggcgg 24840 

ggtagcgcgg tgaaccagga cggcgcgtcg 24900 

cagcagcggg tgatcaccca ggcgttggcg 24960 

gcggtcgagg cacatgggac cgggaccacg 25020 

atcgccacgt acgggcaggg ccgggagaag 25080 

tccaacatcg gtcacacgca ggcggccgct 25140 

gcgatgcggc acgggcagct gcccgccacg 25200 

gactggtcgg cgggttcggt ccggcttctc 25260 

cgtccttgcc gggtgggggt gtcgtcgttc 25320 

ctcgaacagt ctccagtcga gcagggcgaa 25380 

ccggatgtag ccgtccccgt ggtgccttgg 25440 

cgggcgcagg ccgaacgggt gcattcgcat 25500 

gatgtggcgt attcgctagg aatgacacgc 25560 

ggctcggacc gtgccgcgct cctgaccggg 25620 

cccgaagtgg tttcggggtc tgtggggctt 25680 

cagggtggtc agtggccggg gatgggccgg 25740 

gacgcgttcg acgaggcttg cgcggagttg 25800 

cgggatgtgg tgttcggttc gcaagcgtgg 25860 

ggtttgttcg cgttgcagat tggcttgctg 25920 

gatgtggtgt tggggcactc ggtgggtgag 25980 
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ctggctgcgg tgcatgcggc tggtgtgttg 
ggtcgcgccc ggttgatgca ggcgttgcct 
ggtgagtttc aggtcgatcc tctgctggat 
gtgaatggcc cggaatcggt tgtgctctct 
gatcggttgc acgatcaggg gtgccggacc 
tcgccccata tggagccgat gctggaggag 
cacgcaccgg aactgccgat catctcgacc 
atgggcactc ccgagtactg ggtgcgtcag 
gtccaggcgc ttgtcggtca gggtgtcggc 
ttgtcgacgt tggtcgagga gtgtgtggcg 
atgcgcaagg accgcgacga ggcgcgaacc 
cgtggtggtg aggtggactg gcggtcgttt 
ctgcccacct acgccttcca gcggcagcgg 
gacgtgaccg ccgccggatt ggccgaggcg 
ttggcagacg gcgaaggtgt ggtgctgacc 
ttgtccgatc accgggtgct gggcgaaatc 
gtgtggcacg tcggcgagcg cctcggttgt 
cccctgatcc tgccggatca tggagcggtc 
gaatccggag cccggtcggt ggcgctctac 
tggaagaagc acgcgacggg cgtgcttctc 
accgcatggc ccccggagaa tgcgaccgaa 
gaagggcacg gtttcgcgta cggaccggcc 
ggcggggagg tgttcgccga agtcgcattg 
ttcggcgtcc accccgcgtt gctggacgcg 
gtggtccaga gcgaagcgcg ggtgccgttc 
gaaagcgcgg tggtgcgggc gcgcctctcg 
gcagtggacc cggctggccg attcgtggcc 
tcccggcagc aggtgaggtc tggcgcgatc 
cggaaggcgt tgttgggaac aaccgccggc 
agttggccgg aatcggtgcg cgcaaccgca 
gccgtggact cggacgttcc tgccccgggt 
gaggtcgagg gtggatccct gccgtcgcgc 
ctcgtgcagt cgtggcttgc ggacgagcgg 
cgtgcagcgg tgtcggccga ctcggattcg 
tgggggttgt tgagttcagc ccagtcggag 
gacggcacac ctgagtcgtg gcaggcgttg 
ctggcacttc ggcgcggcgt ggcgctggtg 
gagggctcct ccccgcaact cgacacggac 
gcgttggggg gagtggttgc ccgtcacctg 
ttggcaggcc ggcgtggctg gaatgcgcct 
cgcgcgggcg ccgtggttga ggtggtggct 
cacgtgctgg ccgccattcc ggtcgactgg 
gtgctggccg acggagtgat cgggtccttg 
ccgaaggtga cgggggcatg gcatctgcac 
ttcgttcttt tctcttcctt ctccgggatt 
gcggcgaaca cgttcctgga tgcattggcg 
ttgtcgttgg cgtggggact gtgggcgcaa 
gcgtcggtgg agcggttggc gcggacgggc 
cgcctgttcg atgccgcgtt cgcgaaggac 
agggcgctgc tggtcgggaa cggacgatcg 
gttcctgttc gcggcggtgt ggcgaggaaa 
gcactgttgg gtttggtgcg ggagcacgtt 
gaggttgggg gcgaccgtgc tttccgtgat 
ttgcggaacc gccttgccgg ggtgctgggg 
tatccgacgc cgcgggcgct ggcgcgtttc 
tccacgtcga cgccggtgac cagggcagcg 
gggatgggat gtcgttttcc gggtggggtg 
gccggcggcg tggatgcggt ggctgggttc 
ttgtacgatc ctgatcccga tcgtctcggg 
cgggacgcgg cggagttcga tgctgacatg 
atggatccgc agcagcggtt gctgctggag 
atcgatccgt tctcgttgca cggcagccgg 
gactatgggg cccgattcat taccagagca 
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tcgttgtcgg aggccgcgcg gttggtggcg 26040 

tctggtggtg ccatgctcgc ggtcgctacg 26100 

ggggtgcggg accggatcgg tatcgcggcg 26160 

ggtgaccgcg agctgctcac cgagatcgct 26220 

cggtggttgc gggtgtcgca tgctttccat 26280 

ttcgcccaga tctcccgagg ccgcgaatat 26340 

ctgatcggtg agctggacgg tggtcgagtg 26400 

gtgcgtgagc ccgtccgttt cgccgagggt 26460 

acgattgtcg aattgggtcc ggacggggcg 26520 

gaatccgggc gggtggccgg gatcccgctg 26580 

gtgctggcag ctttggcgca gatccacacc 26640 

ttcgccggta ccggggcgaa gcaagtcgac 26700 

tactggctgg catccaccgg gcgtgcgggt 26760 

gaccatccgc tgctcggtgc ggtggttgcg 26820 

ggtcggttga cagcgggttc gcatccgtgg 26880 

gtcgtccccg gcaccgcgat cgtcgagctg 26940 

ggccgggtgg aagaactggc tttggaagcg 27000 

caggttcagg tgctggtggg accgcccggg 27060 

tcctgtcctg gcgaggcgat cgaacccgag 27120 

ccacccgtgg ccgccgagaa ccatgagctg 27180 

atcgatgcag acggggtcta cgcattcctt 27240 

tttagatgtc tgcgcggtgc ctggcgacga 27300 

ccggatgaca tgcaggcggg ggtcgatcga 27360 

gttctgcatg ccgccgcagc cgagacgtcg 27420 

tcgtggcgtg gggtggaact tcgcgccact 27480 

ttgacttcgg atgacgaact gtcgttggtc 27540 

acggttgatt cgctggtgac ccgaccgatc 27600 

ggtgattgcc tgttcgaggt ggagtggcac 27660 

gacgaccttg ccatcgtcgg tgacggtccc 27720 

cggttcgcga ccctggatga gttccgtgcg 27780 

tcggtgttgg tcgcagctat gtcggccgaa 27840 

gcccaagagt cgacctccga tctgctggct 27900 

ttcgccgaat cccagctcgt ggtcgtcacg 27960 

gacgtcgcgg acctggtggg tgcgtcgtcg 28020 

aacccgggtc gcttcgtgct ggtggacgtg 28080 

ccggccgccg tgcgagcagg agaaccgcag 28140 

cctcggttgg cgcgactcac ggtgcgcgag 28200 

gggaccgtcc tcatcacggg tggcaccggt 28260 

gtggaggagc acgggattcg gcgtttggtg 28320 

ggagtccacg agttggtgga tgagctggcg 28380 

tgcgatgtgg ctgaccgcac cgatctggag 28440 

ccgctgcggg ggatcgtgca taccgctggg 28500 

tcggcggcgg atgtgggcac ggtgtttgcc 28560 

gagttgaccc gcgatctgga tctgtcgttc 28620 

gcgggtgccg cagggcaggc caactacgcg 28680 

cgttatcgcc gggcgcgtgg gctgcctggg 28740 

cccagcggta tgacgagtgg cttggacgcg 28800 

atcgcagaac tttccacgga ggatggactc 28860 

cgggcttgcg tcgttgccgc tcgattggac 28920 

cacgcgattc cggcgctgtt gagcgcgttg 28980 

acagccaatt ctcaggccgc ggatgaggac 29040 

tcggccgtgc tgggttattc gggtgcggtc 29100 

ctgggttttg attcgttgtc tggcgtggag 29160 

gtgcggttgc cggcgactgc ggtgttcgac 29220 

ctgcatcagg aactggcagg cgaggtcgcg 29280 

agtgccgaag aggatcttgt tgcgattgtc 29340 

tcgtcgccgg aggagctttg gcggctggtg 29400 

ccagacgatc gcggctggga tctcgcggcg 29460 

acctcgtatg tgtgtgaggg cgggtttctg 29520 

ttcggcatca gcccgcgtga ggcgttggcg 29580 

gtcgcctggg aaaccttgga gcgggctggg 29640 

accggtgtgt tcgcgggctt gatgtaccac 29700 

ccggagggct tcgaagggca cctcgggacg 29760 
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ggcaatgcgg ggagcgtgct gtcgggtcgg 
gcggtgacgg tggatacggc gtgttcgtcg 
gcactgcggg ccggtgagtg cgaattcgcc 
ccgacgacgt tcgtggagtt ctcccgtcaa 
tcgttcgcgg cggccgcgga tggcaccggg 
gagcggttgt cggatgcccg gcgcaatggg 
gcggtgaacc aggacggcgc gtcgaatggc 
agggtgatca cccaggcact cacgagtgcc 
gaggcgcatg ggacgggcac gcggcttggt 
acgtacggcc gggatcgtga tcccggtcgg 
attggtcaca cccaggcggc ggcgggtgtc 
cggcaggggg agctgccgcg cacgttgcac 
tctgcgggca cggtccaact cctcacggag 
cgccgggcgg gcgtgtcatc gttcgggatc 
caacctccgc gagagtcgca gcgctcaaca 
ccggtggtgc cgtggatggt gtcgggcaaa 
gcattgatgt cctacttgag caatcgcgtt 
cttgcggtga cccgtccggc gttggaccac 
gcgttgctgc cgggcttgaa agcgctggcc 
ggcactcgtg ccgctgggcc ggtcggattc 
gggatgggaa gcgggctcca ctcggcgttt 
tgctgcgagc tggatgcgca tctcgggcag 
tcggatacgc aacttctgga ccagaccttg 
gtcggactct gggagttgtt gggttcgtgg 
tcggtcggtg agctggcggc ggcgttcgcg 
cggctggtgg cgggccgtgc ccggttgatg 
gctgcggctg ctggagagga gcagctgcgc 
gggatcgccg cggtcaacgc tcccgggtcg 
gatgacattg ccggtcggct ggacgggcaa 
catgcgtttc attcgcatcg gatggatccg 
agcgtggact accggtcgtc agggctgccg 
gaggtcggca tgccggctac gccggagtat 
ttcgccgacg gtgttgctgc gctcgcggct 
ccggatgggg tgttgtcggc gctggtgcag 
cgggtggccg cggttccgct catgcgcagc 
gcattggcgc agatccatgt gcgtggtgct 
accggggcaa agcaggtcga gctgcccacg 
gactcaccat ccgaaccggt cgggcaatcc 
gaactcgtcg agcaggaaga tgtcagcgcg 
cacgacgtgc aggcgtccct ggaatcggtg 
atccgcaacg aatccctggt gcaccagtgg 
gatttgccag acccctcgtt gtcggggaca 
gcgagtcggc aagttctgcg tttcaacgag 
ctgttcgagc tcgccgggca cgacgaggaa 
gttgcgtcag ggggaataag cggcgtgttg 
tcgccgaacg ctgctttgcc gaatggcgcg 
cgggccgcgg atgtgtcggc gccattgtgg 
gatgtgccgg tgaacccggg gcaggcgctg 
gagcatccgg cctggtgggg tggcctggtc 
cgagaacgct tgtcggtcgt gttggcaggt 
ggtggtgtgt tcgtgcggcg gttggaacgc 
tggcgtcctc gggggacggt gttggtgacg 
gcccggtggt tggcgggtgc cggggctgag 
gcggctccgg gcgctggaga tttgcgggcg 
atcacggcct gcgacgtggc cgatcgtgac 
gatgattgcc cgctgaccgc ggtgatgcat 
gcgtcgatgt gtttgaccga cttcgttggg 
aatctcgatg agttgctcgc cgatgtcgag 
tcgggtgtgt ggggtgctgg cgggcagggc 
gcgttggcgc agcagcgtcg ggcaaggggg 
tgggccggtg acggaatggc cgcaggtgaa 
gtgccaatgg ctgcggatcg ggcgttgctg 
acatccctgg tcgtggccga tatggcgtgg 
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gttgcgtatt cgtttggttt cgagggtcct 29820 

tcgttggtgg cgttacacct ggcgggtcaa 29880 

cttgccggtg gcgtcacggt gatgtcgacg 29940 

cggggtctgg ctccggatgg gcggtgcaag 30000 

tggggcgagg gtgccggtct ggtgttgctg 30060 

cacgaggttc tggcggtggt gcggggtagc 30120 

ttgactgcgc caaatggtcc gtcacagcaa 30180 

gggctgtccg tgtccgacgt ggatgctgtg 30240 

gatccgatcg aggcgcaggc gttgatcgct 30300 

ccgttgtggc tggggtcggt gaagtcgaat 30360 

gctggtgtga tcaagatggt gatggcgatg 30420 

gtggacgagc cctccgcgca ggtggactgg 30480 

aacacgccct ggcccgacag cggtcgtctt 30540 

agtggcacca acgcgcacct gatccttgaa 30600 

gagccggatt cgggttctgt ccgcgatttt 30660 

acacccgaag cgctatccgc ccaggcagat 30720 

gatgcttccc cgcgagatat cggttattcg 30780 

cgcgctgtcg tgctgggtgc ggatcgtgcc 30840 

gttagtaatg acgctgccga ggtgatcacc 30900 

gtgttctccg gtcaaggtgg tcagtggccc 30960 

ccggtgttcg ccgacgcgtt tgacgaagcc 31020 

atggcccggc tacgagatgt gttgtccggt 31080 

tgggcgcagc cgggcctgtt cgcgttgcaa 31140 

ggtgtccggc ccgctgtggt gctgggccac 31200 

gctggagtgt tgtcgttgcg ggatgcggct 31260 

caagccctgc caactggcgg tgccatgctc 31320 

ccgttgctgg ccgactgcgg tgatcgtgtg 31380 

gtggtgctct ccggtgatcg ggatgtgctc 31440 

gggatccggt ccaggtggtt gcgggtttcg 31500 

atgctggcgg agttcaccga aatcgcccgg 31560 

atcgtgtcga cgttgacggg tgagctcgat 31620 

tgggtgcgcc aggtgcgaga acccgtccgc 31680 

cacggtgtga gcaccgtcgt cgaggtcggt 31740 

gagtgcgcgg ccggatccga tcagggcgga 31800 

aatcgcgacg aggcgcacac ggtgacaacg 31860 

gaggtggact ggcggtcgtt tttcgccggt 31920 

tatgccttcc aacgacagcg gtactggctt 31980 

gccgatcccg cgcgccagtc gggcttctgg 32040 

ctcagcgccg ctctgcacat taccggcgat 32100 

gttccggtcc tctcctcctg gcatcgccgg 32160 

cggtaccgga tttcctggca tgagcgggca 32220 

tggctcgtcg tcgtgccgga ggggtggtcg 32280 

atgttcgagg aacggggttg cccggcagtt 32340 

gccctggcgc aacgattccg ctcgttgcct 32400 

tccttgctgg cgctggatga atcgccgtcc 32460 

ctgaactcgt tggtactgct gcgagctctg 32520 

ttggcgacgt gtggtggtgt cgcggtcggg 32580 

gtgtggggac tgggtcgcgt cgtcggtctg 32640 

gacgtgccgt gcttgctcga tgaggacgct 32700 

cttggcgagg acgagatcgc ggtacgtccc 32760 

gctggtgcgg cgtcgggtgc cgggtcggtg 32820 

ggtggtacgg gcggtttggg ggcgcatgtt 32880 

catgtggtgt tgaccagccg tcgaggcgcg 32940 

gagctggagg cgctgggcgc tcgggtttcg 33000 

gctttggccg aagtgttggc gaccattccg 33060 

gcggcggggg tcgttgaagt cggcgacgtg 33120 

gtgctgtcgg cgaaggcagg tggtgcggcg 33180 

ctggatgcct tcgtgctgtt ctcatccgtc 33240 

gcttatgcgg cggcgaatgc ctacttggat 33300 

ttggtgggga ctgcggttgc gtggggcccg 33360 

ggcggtgcac agctgcgccg ggccggcctg 33420 

gcacttcagg gcgcattgga tcgtgacgag 33480 

gagaggttcg ccccggtgtt cgccatgtcc 33540 
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cgtcggcgtc cgctgctcga cgagctgccc 
aacaccactg atgctgcgga ctcggccgtc 
gccgaacgcc gccgcgcgat gctggacctg 
cacaacgggt ctgacccagt tggtcccgac 
ctgatggccg tcgaactgcg caacaggttg 
acgttgatct tcgattatcc gagcccatcc 
gtgggagcgc agcccgcgac caccgtcgtg 
gtcgtggtcg cgatgggatg ccggtatccg 
cagctggttt ctgcgggacg tgatgcggta 
tgcaacacgt tgttcgaccc ggatccggat 
gccttcctga ccggtgctga tcggttcgac 
gcgcgcgcaa tggatccgca gcagaggttg 
cgagcaggaa tcgctccgct gtcgttgcgg 
aatgggcagg accacggtgc gaaagtggct 
accggaaacg ccgcgagtgt cctggccggc 
cctgcggtgg cggtggatac cgcgtgttcg 
cagtcgctgc gttcgggtga gtgtgatatg 
acacccctgg ctttcctcga gttctctcgt 
aagtcgtttg cggccgctgc ggatggcacc 
ctggagcggt tgtcggatgc tcgtcggaat 
tctgcggtga atcaggatgg tgcgtcgaat 
cagcgggtga ttcggcaggc cctcgcgaat 
gtggaggcgc acgggaccgg taccgggctc 
gcgacatatg ggcaggagcg ggatcctgag 
aacatcggcc acacgcaggc ggcggccggt 
atgcggcacg gggagttgcc tgcgacgttg 
tggtctgccg gggccgttcg gctcctcacc 
cctcgtcgag cgggggtgtc gtcgttcggg 
gaacaaccac cgtcggaacc agcggagatc 
ccagcggtga tcccgtggat gttgtcggct 
gctgcgctgc aggcccggct ggaccggggt 
tcactcgcga ccactcgttc tgtgctggac 
gaggcactgc tgtccaggct ggcagcgctc 
acgggctctg cgaattccgg tggccgcatc 
tggctgggga tgggaaaggc gttgtgcgcg 
gaagcctgcg acgcgctaag cgcacacctg 
gctgatgagc agatgctcga ccggacgctg 
gtcggcctcc tgggattgct gaggtcgtgg 
tcggtcggcg agttggctgc ggcgcacgcg 
cggttggttg cggctcgggc ccacctgatg 
gcggtcgcca ccagcgaggc ggcggtcgga 
agcatcgctg cgatcaacgg ccccgagtcg 
gtggagctcg caggcgaatt cgatgcccga 
catgctttcc actcgcaccg gatggaaccg 
tgcgtcgagt tcggtgaacc ggtggtgccg 
accaccggac tgatgtgcgc ggccgactac 
ttcggagacg gtgtccgggc gctcgtcggc 
ccggacgggg cgttgtcggc cctggtcgag 
agggtggcgg cgatcccgct gatgcgcagg 
gccctggcgc acgtgcacgt ccgcggtggt 
accggcgccc gcaccgtcga gttgcccacc 
gccgggcaag cggacgggcg cggcggcgat 
tgggagttgg tcgagcgcgc cgatccggaa 
gaccagccct tccgggaggt gctgcccgtt 
gaggccctcg cggattcctg gcgctaccag 
gcagccgccc tccggggcgt gtggctggtg 
ccggcggtcg tcatcgacgc gctgatcgcg 
accgagcagg acctccaacg cagtgcgctt 
cgcaccgagg tgacgggtgt gttgtctctg 
catccgcacc tgtcccgtgg tgtcgccgct 
gcgggtgttt ccgccccgct gtggctggcc 
gacggtccgg ccgatccgga ccacggcttg 
gaacatccgc agtggtgggg tggcctgatc 



LeA33955-US 
gaagcacagc aggcgttggc ggatgcggag 33600 
ccgctaccgc ggctcgcggg catggcagcc 33660 
gtgctggcgg aggcctcgat tgtgttggga 33720 
cgggcgttcc aggagctcgg atttgattcg 33780 
ggcgaggcaa caggattgag tctgccggcc 33840 
gcgctggctg agcagctggt cggcgagctg 33900 
gccggggccg atccagtgga tgatccggtt 33960 
ggcgacgtct gctcgcccga ggagctgtgg 34020 
tcgacgttcc ccgtcgatcg gggttgggac 34080 
cgggcaggca gtacctatgt gcgagaaggt 34140 
gccgggttct tcggcatcag ccctcgcgag 34200 
ttgctcgaag tggcgtggga ggttttcgaa 34260 
ggtagcagga ccggtgtgtt cgcggggacc 34320 
gccgcgccgg aggcggcggg tcacctcctg 34380 
cggctttcct acacgttcgg ccttgagggg 34440 
tcgtcgttgg tggcgttgca tttggcgtgc 34500 
gcgttggcag gtggtgtgac ggtgatgtcg 34560 
cagcgcggtt tggcgccaga tggtcggtgc 34620 
gggtggggtg agggtgccgg cctggtgttg 34680 
ggtcaccggg tgttggccgt ggttcgcggg 34740 
ggcctgactg cgccgaatgg tccgtcgcag 34800 
gcggggctgt cggcgtccga tgtggatgtc 34860 
ggggatccga tcgaggcgca ggcgctgatc 34920 
cgggccctgt ggctggggtc gatcaagtcc 34980 
gtggcggggg tcatcaagat ggtgcaggcc 35040 
cacgtggaca agcccactcc acaggtggac 35100 
gggaacacgc cctggcccga gagcggccgt 35160 
atcagcggca ccaacgcaca cctcatcctc 35220 
gaccaatcgg atcggcgggt cactgcgcat 35280 
aggagtctcg cagcgctgca ggcccaagcg 35340 
cctggcgctt ctccgctgga tttggggtat 35400 
gaacgcgccg tcgtgtgggg tgccgatcgg 35460 
gccgatggcc ggacggcgcc gggggtgata 35520 
ggattcgttt tttccggtca gggcagtcag 35580 
gctttcccgg cgttcgcgga cgccttcgag 35640 
ggcgcggacg ttcggggtgt gctgttcggt 35700 
tgggcgcagt cggggatctt cgcggttcaa 35760 
ggcgtgcggc cggccgcggt gctggggcac 35820 
gctggtgtgt tgtccttgcc ggacgctgca 35880 
caggcattgc ccaccggcgg cgcaatgctc 35940 
ccgctgcttt ccggggtgtg cgatcgggtc 36000 
gtagtgctct ccggcgaccg cgatgtgctc 36060 
gggcttagga ccaaatggtt gcgggtctcc 36120 
attctggacg agtacgcgga aaccgccagg 36180 
atcgtctccg ccgcgaccgg tgcgctggac 36240 
tggacgcgcc aagtgcgtga tcctgtccgc 36300 
caaggcgtgg acacgatcgt cgagttcggc 36360 
cagtgcttgg ccgggtccga ccaggctggg 36420 
gaccgcgatg aggtcgagac cgcggtggcg 36480 
gcggtggact ggtcggcttg cttcgccggc 36540 
tacgccttcc aacgccagcg gtactggctg 36600 
gtggttgccg acccggtcga cgcgcgcttc 36660 
ccgttggtgg atgaactctg catcgaccgg 36720 
ctggcttcct ggcgcgagaa acaacgccag 36780 
gtgcgctgga ggtccgtcga ggtgccgtcc 36840 
gtgcttccag ctgacgtgcc ccgagatcaa 36900 
cgcggcgccg aggtcgcggt cctggaattg 36960 
gtggacaagg tgcgcgccgt cattgcggac 37020 
ttggcgatgg acggcatgcc ctgcgcggcg 37080 
accgtgatcc tgacgcaggt gttgggcgat 37140 
acgaccggtg gcgtcgaggc cgggaccgag 37200 
atctgggggc tcggcagggt cgtcggcctt 37260 
gaccttccgg agacactgga cgagacgtcc 37320 
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cggaacgggt tggtggccgc actcgccggg 
tcatccgggt tgttcgttcg cagagtggtg 
tggcgtagcc ggggaacggt cctcatcacg 
gcacgatggc tggcccggcg gggagctgag 
gaagctcccg gcgcagcgga cctaggggcc 
gtcttggcct gcgatgtgac ggaccgcgac 
acggagtatc cgctgtcggc ggtcgtgcac 
gccgagacga ccttggcgca gttcgccgac 
aacctggacc ggctgcttgg cgggcaaccg 
tcgggagttt ggggagccgg cggccaagga 
gcccttgccg agcgccgacg ggcttgcggg 
tgggcgggtg cgggcatggc cgttcaggaa 
ctggtaccga tggaaccgca gtcggccctc 
gaaaccgcca tcaccgtcgc agatgtggac 
gcccgcccgc gaccactgtt ggaagagatc 
gagaagcacg gtgccggcga gctggggcag 
ggacacctgc tgctggaggt ggtgctggcg 
gcggaggctg tgcaacccga tcggaccttc 
gtagagctgc gcaacaggtt gaacgcggtg 
ttcgaccacc cgacgccgct ggcgttgtcc 
ccggacaacg gcatcgaatc gctgctcgcc 
caagggcctt cgatcccact ggaagaccag 
ctcgccaagt gggacggggc gcgtgacggc 
acggcggcca cggacgacga aatcttcgac 
ccctttcctc gcctcagctc ccctgattac 
aaaagctccg cgagtacctc aagcgtgtcg 
tgcacgagtt ggagcgccag gagcacgacc 
atcccggtgg cgtctccact ccggaggagc 
cgatcgcgaa cttccccgaa gaccgtggct 
cgggccgagc cgggacctcc tacgtccgcg 
tcgatgccgg gctcttcggg atcagtccgc 
ggttgctgct ggagatctcg tgggaggtgt 
tgcggggtac caagaccggt gtgttcgcgg 
ttcgcaagac ccccgcggag ttcgagggtt 
catccggccg ggtggcttac accttcgggt 
cctgctcgtc gtccctggtg gcgctgcacc 
gcgacctggc cctggccggt ggcatttcgg 
tcagccggca acgcgcactc gcctcggatg 
acggcaccgg ctggggcgag ggcgccggaa 
gacgaaacgg ccacccggtg ctggcggcgg 
cgtccaacgg cctgaccgcg cccagcggtc 
tggcgaacgc cgggttgtcg cccgccgagg 
cggccttggg cgacccgatc gaggcgcagg 
cggcggatca tccgctgctg ctgggttccc 
ccgccggtgt ggccggggtg atcaagtcgg 
gcagcctgca catcgaccag ccatcgcagc 
tgctcacgga cagcgttgac tggccggatc 
cgttcggcat gagcggtacc aacgcacacc 
tctcgggcag taccgagccg accggggcat 
cggcattgcg cgagcaggct gccgagttgc 
gactggggga cgtcgggtac tcgctggcca 
tcgtggttgc cgacgatcgg gactctttcg 
ttccggcagc caacgtggtg cagggcgcgg 
tccccggcca gggctcgcat tggcagggga 
tgttccggcg gaagctggcg gaatgcgcgg 
tgctcggcgt ccttcgcggt gatcccgatg 
agctcgcgct gttcgccatg atggtgtcgc 
agcccgccgc ggtggtcggt cattcccagg 
ctttgtcctt gactgatgcg gtgcgcatca 
tgaccgggaa gggaggcatg ctcgcgattg 
tcgcaggcct gccggagctg accgttgcgg 
ccggcgaacc gtcggctctg gagcgtctgc 
cccggcgggt gggaattgat tacgcctcgc 
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acggcggccg aagatcagct cgccgtgcgt 37380 
cgcgcagcgc ggaacccccg gtcagagaca 37440 
ggcggaacag gcgcgctcgg tgccgaggtc 37500 
cacctggtgt tgatcagtcg ccgcggcccg 37560 
gagctgactg aactcggcgt gaaagtcaca 37620 
gagctggcgg cggtgctggc ggccgttccc 37680 
accgccggcg tcgggacgcc tgcgaacctg 37740 
gtgttgtcgg ccaaggtcgt cggcgcggcg 37800 
ttggacgcct tcgtgctgtt ctcctcgatc 37860 
gcctattcgg ccgccaatgc gtatctcgat 37920 
cggccggcga cgtgcatcgc ctggggtccg 37980 
ggtaacgagg cgcatctccg ccgaaggggc 38040 
ttcgcgctgc aacaggccct gtcccaacga 38100 
tgggagcgat tcgccgcctc tttcaccgcg 38160 
gtggatctac ggcccgacac cgagaccgag 38220 
cagctggccg cactgccgcc cgctgagcgc 38280 
gaaaccgcca gcaccctggg gcacgattcg 38340 
gccgaactgg gcttcgattc gctgaccgcg 38400 
accgggcttc gcctgccgcc gacgctggtt 38460 
gaacagttgg ttccggccct ggtcgcggag 38520 
gagctcgaca ggctggatac cacgttggcg 38580 
gccaaggtgg cggagcgctt gcacgcactc 38640 
acggccagag cgacgtcacc ccaatcgctg 38700 
ctcatcgacc ggaagttccg gcgctgaccg 38760 
tggaacggtg tatttcgatg gccaatgaag 38820 
tcgtcgaact ggaagaggcg cacgaacgcc 38880 
ccatcgcgat cgtgtcgatg ggatgtcgtt 38940 
tgtggcgact ggtcgtcgac ggaggagacg 39000 
ggaatctgga cgagctgttc gatcctgatc 39060 
agggtggttt cctgcgcggg gtcgcggact 39120 
gcgaggcaca ggcgatggac ccgcaacagc 39180 
tcgagcgcgc cggcattgac ccgttttctt 39240 
gcctgatcta ccacgactac gcgtcgcggt 39300 
acttcgccac cggcaacgcg ggcagcgtcg 39360 
tagagggccc ggcggtcacc gtggacaccg 39420 
tggcctgcca gtccctgcgg ctgggcgaat 39480 
tgatggccac gccgggagcc ttcgtcgagt 39540 
gccggtgcaa gcccttcgcg gatgccgccg 39600 
tgctgctgct ggaacggctg tcggacgcac 39660 
tggtcggttc cgcgatcaac caggacggga 39720 
ccgcacagca gcgagtgatc cgccaagccc 39780 
tcgatgtggt cgaggcgcac ggcacgggca 39840 
ccctgatcgc cacctacggg gcgaaccggt 39900 
tcaagtcgaa catcggccac acccaggctg 39960 
tcctggccat caggcaccgg gagatgcccc 40020 
acgtggactg gtcggcgggc gcggtgcggc 40080 
tcggcaggcc gcgccgagca ggggtgtcct 40140 
tgatcgtcga ggaagtatcc gacgagccgg 40200 
ttccctggcc gctgtccggc aagacggaga 40260 
tctccgtagt gaccgagcac ccggagccgg 40320 
ccggtcgcgc tgcgatggag caccgggctg 40380 
tcgccggact gacggcgttg gctgcgggcg 40440 
ccgactgcaa gggaaaggtc gcgttcgtgt 40500 
tggcgaggga actgtccgaa tcctcgccgg 40560 
cggctacggc cccttacgtg gactggtcgc 40620 
cacccgcgct ggatcgcgac gacgtgattc 40680 
tggccgaact gtggcgttcg tgcggagtgg 40740 
gcgagatcgc cgccgcccat gtggcaggcg 40800 
tcgctgcccg ctgcgatgcg gtgtcggcgc 40860 
ccttgccgga aagcgcggtg gtgaagcgaa 40920 
cggtcaacgg acccggctcc actgtcgttt 40980 
agaccgaact gaccgcggaa aacgtgcaga 41040 
attcgccgca gatcgcgcag gtccagggcc 41100 
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ggcttctgga ccggctgggc gaagtcgggt 
cggtcaccgg cgagcggacg gacaccggcc 
ttcggcagcc cgtccgcttc cagcagaccg 
tcttcgtcga ggtgagcccg cacccgctgc 
ccgcggacgc gggcggggtg gtggtcggtt 
gctggctgac ttcgctggcc gagtgccagg 
tattcctcaa caccggagcc cgacgcgtgc 
ggtactggtt ggagtccgcc gagtacgacg 
ccgccgagca tcccctgctc ggggctgcgg 
tgaccggcaa gctgtcggtc aagacccagc 
cgatcctgct gcccggcacc gcgttcgtgg 
ggtgcgatct gatcgaggag ttgtccctga 
cggtgcaggt gcagatcgcg gttggcggtc 
tgcattcctg tcgagacgac gccgtgccgc 
cgttgacctc cagcgatcac caggacgccg 
acgatgctgt cgcggttccg ctggacagct 
atttcggccc ggcgttccag gggttgcagg 
ccgaggtcgg cctgcccacc gcacaccgcg 
ctctgctgga tgcggcactg caggcgctgg 
ggctcccgtt cgcgtggcaa ggtgtgtccc 
tgcacctcgt tccggcgggc gcgaatgcgg 
aagccgtgct ctccatcgat tcgctggtgc 
cggcccgtgc gatggaacac gagtccctgt 
gcgctgccaa gccggtctcc tgggcagtga 
gctcggcact tggcacggaa ctccaccccg 
acgtcgtggt ggtgccatgc ggtgcgtctc 
gtgccgcgac acaacgcatg cttgacctga 
ccggatctcg cctggtggtt gtgacgtgtg 
tttccgacct ggtgcatgct gcgtcgtggg 
cggaccgatt cgtgttggtc gatgtggacg 
cggccgtgcg ttccggagaa ccgcagctgg 
gcctggcgcg atgtgttgcc gccgaggaca 
cggtgttgat ttccggcggt acgggcctgc 
cggagcgcgg tgtccgccgc ctggtgctcg 
tcaccgacct ggtggatgag ttggtgggcc 
atgtcgggga tcgggcccag ttggaccggc 
tgcgcggagt ggtgcatgcg gccggggcac 
cagagcacgt ggcaaaggtg ttcggcccga 
tgactcttga tctggatctc tcgttcttcg 
gggctgcggg tcagggaaac tacgcggcgg 
accggcggac ggcggggctg cctgcggtgt 
gcgggatgac cggagcgctc gatgcggcgg 
cgatgtccgc gccggacggg ttgcggctgt 
cgcttctggt tccggtccac gtcgacctga 
gtgtgcctgc gttgttgcgc gacctggtgc 
agtcggcgga cgtcaacggt ctggttggtc 
aaacccagct gttgggtttg gtgcgcgagc 
cggtcgaggt cggggccgat cgtgctttcc 
tggagtttcg gaaccggctt ggcggggtgc 
tcgactatcc gacaccgcgg gcgttggttc 
tggaggctcc gactcccgca ccggcggctg 
tgatcgtggg gatgggctgt cgttatccgg 
gtttggtggc cgggggcttg gatgcggtgg 
aggcggggtt gttcgatccg gatcccgatc 
gcttcctgcg agatgcggca gagttcgatg 
cgttggcgat ggatccgcag cagcggttgc 
gggcggggat tgatccgctt tcgttgcggg 
tgcaccacga ctacggcgcg cggttcatca 
taggtaatgg cagcgcggga ggcgtgtttt 
agggtcctgc ggtgacggtg gatacggcgt 
cgggtcaagc actgcggtct ggtgagtgtg 
tggccacgcc ggggatgttc gtggagtttt 
ggtgcaagtc gtttgcggcg gctgcggatg 
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ccgaacctgc tgagatcgct ttctactcga 41160 

gactcgacgc cgactactgg taccagaacc 41220 

tcgcccggat ggcagatcag ggctatcggt 41280 

tcaccgccgg aatccaggaa acgctggaag 41340 

cgctgcggcg tggcgagggc ggctcccggc 41400 

tgcgcggact gccggtgaat tgggaacagg 41460 

cgctgccgac ctacccgttc cagcggcagc 41520 

cgggcgatct cggttcggtg ggcttgctct 41580 

tgacgctggc cgatgcgggc gggttcctgc 41640 

cctggttggc cgaccacgtg gtcggcgggg 41700 

aaatgctgat acgcgccgcg gaccaggtcg 41760 

cgactccgct ggttttgccc gcgaccggtg 41820 

cggacgaggc cgggcgccgc tcggtccgcg 41880 

aggactcgtg gacctgccac gcgaccggca 41940 

gccagggccc cgatgggatt tggccgccca 42000 

tctacgcccg cgcagctgag cggggcttcg 42060 

cggcttggaa gcgcggagac gagatcttcg 42120 

aagacgccgg caggttcgga atccaccctg 42180 

gcgcagccga agaggatccg gacgagggat 42240 

tcaaagcgac gggcgcactt tcccttcggg 42300 

tgtcggtgtt cacgaccgac acgactggcc 42360 

tgcgccagat ttcggacaag cagttggcag 42420 

tccgggtcga ctggaagcga atctcgcccg 42480 

tcggcaatga cgaactcgcc cgagcctgcg 42540 

acctgaccgg gttggctgac ccgcccccgg 42600 

gccaggactt ggacgttgct tccgaggcac 42660 

tccaggattg gttggcggcg gcgcgattcg 42720 

gtgcggcgtc gacaggtccc gccgagggtg 42780 

gtttgttgcg ttcggcgcag tcggagaacc 42840 

gaaccgccga atcatggcgt gcgctcgcgg 42900 

cgttgcgcgc cggtgaagtc cgggtgcctc 42960 

gccggatccc agtgcccggt gcggatggga 43020 

tgggcgggtt ggttgcccgg catttggtgg 43080 

cggggcgacg cggctggagc gcccccgggg 43140 

tgggagctgc ggtcgaggtg gcgagctgcg 43200 

tgctgacgac gatctcggca gagttcccgc 43260 

ttgccgacgg ggtcgtcgag tcgctgacac 43320 

aggccgccgg tgcgtggcac ctgcacgagt 43380 

tgctcttctc ctcgttctcc ggcgtggcgg 43440 

cgaacgcgtt cctggacggc ctggctcagc 43500 

cgctggcttg gggcttgtgg gagcagccca 43560 

gccgtagccg cattgcgcgc accaatccgc 43620 

tcgagatggc gtttcgcgtt ccgggcgaat 43680 

acgccctgcg cgctgatgcg gccgacggcg 43740 

cagcgcccgt gcggcggagc gcggtcaacg 43800 

ggctgcggag gctgccggac ctggatcagg 43860 

atgtttcggc ggtgctgggg cattcgggtg 43920 

gggatttggg ttttgattcg ttgtccggtg 43980 

tgggcgttcg gttgccggct actgcggtgt 44040 

ggttcttgct cgacaaactg attggtggcg 44100 

tggcggcggt gactgctgac gatcccgttg 44160 

gtggggtgtc ctcgccggag gagctttggc 44220 

cggagttccc ggacgatcgt ggctgggatc 44280 

gtcttgggac ctcgtatgtg tgtgagggtg 44340 

ccggtttctt cgggatttcc ccgcgtgagg 44400 

tgctggaagt cgcttgggaa accgtggagc 44460 

ggagccggac cggcgtgttc gcggggctga 44520 

cgagggcgcc ggagggtttc gagggttatc 44580 

cgggtcgggt tgcgtattcg tttggtttcg 44640 

gttcgtcgtc gttggtggcg ctgcacctgg 44700 

atctggctct tgcgggtggt gtgacggtga 44760 

cgcgtcaacg gggcttggcg gcggatgggc 44820 

gcaccggttg gggagaaggc gcgggcttgg 44880 
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tgttgttgga gcggctgtcg gatgcccggc 
ggggtagcgc ggtgaatcag gatggtgcgt 
cgcagcagcg ggtgatcacg caggcgttgg 
acgccgtgga ggcgcatggg actggaacca 
tgattgccac ttacgggcag gggcgggata 
agtcgaatat tggtcatacg caggcggcgg 
tggcgatgcg gcacgggcag ctgcccgcga 
tggattggtc ggcgggggat gtccagctcc 
gccatcctcg gcgggtgggc gtgtcgtcgt 
tcctcgaaca agcctcgaaa acaccagacg 
aatcgaccgt ggaccttcca gcggtcccgt 
tcagcgctca ggcgagcgca ttgttgtcct 
tggatgcggc gttttcgttg gcttcctccc 
tgggagcgga ccgcgaaacg ttgttgtccg 
cttctggggt ggtgtcggga tccccggtct 
agggcggaca gtggttgggg atgggccggg 
acgcgtttga cgaagcatgt gccggactgg 
gggatgtggt gtttggttcc gacgggtcct 
gtttgttcgc gttgcaggtt ggtttgctga 
gtgtggtgct gggccattcg gtcggcgagt 
cgttgccgga tgcggctcgg atggtggcgg 
ctggcggtgc catgttggcg gtggctgctg 
atcgggttga tggtgcgggt atcgccgcgg 
gcgatcggga ggtgcttgac gacatcgccg 
ggcggttgcg ggtttcgcat gcgtttcatt 
tcgccgaaat cgcacgcagc gtggactacc 
tgacgggtga gctcgacacc gcaggtgtga 
ttcgagagcc cgtccgcttc gccgacggcg 
cgatcttcga actcggccct gatgcgacgc 
gggctgatca ggccatgccg attccgatgc 
tggtcgccgc ggtggcgcgg gcgcacacgc 
tcgccggcac cggggcacgg cgggtcgagt 
actggctgga aacatcggat tacggcgatg 
atccgttgct gggggccgtg gttgcgctgg 
ggttgtcggt ggggacgcat ccgtggctgg 
tccccggcac cgccatcctg gagatggccc 
gggtggaaga gctcaccctg gaaacaccgc 
gtagtcgtgg ccctgcggga gggaccacag 
ggacgaacga cgccatcgaa atccagctgc 
gaagggtgtc gctgtattcc cgcccggccg 
acgccaccgg cgaactcgtc gtcggcacca 
ctgagggtgc cgagtcgatt gctctcgatg 
tcgagtacgg gccgttgttc caggggcttc 
tcgccgaaat cgccccgccg gccgaggccg 
cagcgttgct ggatgccgcg ctgcaggcgt 
gcgcgtggct gccgttctca ttcaccggcg 
gcagggtgcg gctggagacc aggcgacccg 
gtgggcggtt gctcgcctcg atcgattctc 
tggcgaatcg ggacgctgtc cgcgacgcgc 
cgcagtcgac ggaaccgggt cgctgggccc 
atctcatcaa actcgcaacg gattccgccg 
agaaacttga ttccagcgcg ctggttcctg 
cggatcccgg caccggcgca gccgcacttg 
aagcgtggtt ggctgagccg cggttggccg 
cggtgacgac ggctccgagt gacggtgcat 
tgttgcgtgc cgcgcaggtg gagaacccgg 
ccgccgaatc gtggcgtgcg ttgccgagtg 
tgcggaaggg cgcggtgcga gcgccccgct 
ccgcggttgt ggcggatccc gaccgaaccg 
ggggcgcggt tgcccgccac ctggtgaccg 
gccgtcgtgg ctgggatgct cctggaatca 
gtgccgtggt cgacgtggtg gcgtgcgacg 
tggcggcggt cccggcggaa tttccgttgt 
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gcaacgggca cgcggttctg gcggtcgtgc 44940 
cgaatggttt gacggcgccg aatgggccct 45000 
cgagtgctgg tttgtcggtg tctgatgtgg 45060 
ggcttggtga tccgattgag gcgcaggctc 45120 
gcgatcggcc gttgtggttg gggtcggtga 45180 
cgggtgtcgc tggtgtgatc aagatggtga 45240 
cgttgcatgt ggatgaacct acgtcggaag 45300 
tcacggagaa caccccctgg cccggcaaca 45360 
tcgggatcag cggcaccaac gcacacgtca 45420 
agactgcgga caagagcggt cccgattcgg 45480 
tgatcgtgtc ggggagaaca ccggcagcgc 45540 
atttgggtga gcgtggcgat atttccacgc 45600 
gggccgcgtt ggaggagcgg gcggtggtgc 45660 
ggttggaagc gctggcttcc ggtcgcgagg 45720 
ctggcggggt tgggttcgtg ttcgccggtc 45780 
ggctctactc ggtttttccg gtgttcgctg 45840 
acgcgcatct ggggcaggac gtgggggtcc 45900 
tgttggatcg gacgctgtgg gcccagtcgg 45960 
gcctgctggg ttcgtggggt gtccggccgg 46020 
tcgcggcggc ggttgcggcg ggagtgttgt 46080 
gtcgtgcccg gttgatgcag gcgttgcctt 46140 
gtgaggagca gctgcggccg ttgttggccg 46200 
tcaacgctcc tgagtcggtg gtgctctccg 46260 
gcgcgctgga tgggcaaggg attcggtggc 46320 
cgtatcggat ggacccgatg ttgcaggagt 46380 
ggcgtggcga cctaccggtc gtgtcgacgt 46440 
tggctacgcc ggagtattgg gtgcgtcagg 46500 
tccgggtgct cgcgcagcaa ggggtcgcca 46560 
tgtcggccct gattcccgat tgtcattcgt 46620 
tgcgtaaaga ccgtacggaa accgaaactg 46680 
gtggtgttcc ggtcgaatgg tcggcgtatt 46740 
tgccgacgta tgccttccag cggcagcggt 46800 
tgacgggtat cggcctggct gcggcggagc 46860 
ccgatggtga tgggatggtg ctgaccggcc 46920 
cccagcatcg cgtgctgggc gaggtcgtcg 46980 
tgcacgcagg ggcgcgtctc ggctgtgacc 47040 
tggtggtccc cgaacgcgcg gcgggtgccg 47100 
tttcaattga aactgcggaa gaacgtgtgc 47160 
tggtgaacgc acccgacgaa ggcggtcggc 47220 
gtgggtcgag aggtgggggt tggacgcgcc 47280 
ccggtggtag ggcggttcct gattggtcgg 47340 
agttctacgt cgctctggcc ggaaacgggt 47400 
aggcggcatg gcgtcgtggt gacgaggttc 47460 
atgcgatggc gtcgggatac ctgctcgacc 47520 
ccgcgctcgg cgaccgcccg gagcaaggcg 47580 
tcgaactttc cgctccggca gggacgatca 47640 
acgcgatatc ggtggccgtg atggatgaga 47700 
tcaggctacg aagcgtgtcg tcgggacagc 47760 
tgttcgaggt gacctgggag ccggtggcga 47820 
tgcttggtga tactgcctgc ggtaaagacg 47880 
accgctgcgc ggatctggcg gcgctagccg 47940 
atgtcgtggt ctactgcgcc ggagaacagg 48000 
cggagaccca gcagacgttg gctctgctcc 48060 
aggcacgtct ggtggtggtg acgtgtgcag 48120 
cagagctggc acatgcgccg ttgtgggggt 48180 
ggcagtttgt gctggcggac gtcgacggaa 48240 
cgttgggctc gatggaaccg cagttggccc 48300 
tggcttcggt cgccgggcag atcgacgtgc 48360 
tgctgatttc gggcggcacg ggcctgttgg 48420 
aacgcggtgt ccgccgattg gtgttgacgg 48480 
ccgagttggt gggtgagctg aacggcctcg 48540 
tcgcggatcg tgctgatctg gagtcgttgc 48600 
gcggcgtggt gcatgccgcg ggggcgctgg 48660 
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ccgacggggt gatcgagtcg ttgtcaccgg 
cggcgggggc gtggaatctg cacgagctga 
tgttctcctc gctttccggt gttgccggcg 
acgcgttcct ggacgcattg gcgcattacc 
tggcctgggg cctgtgggag cagccgagcg 
ggagcaggat cgcgcgcgcc aacccgccgt 
atgccgggct ggcgctggac cgggcagcgg 
tggccgagca ggcgcggtcg ggctcgctgc 
tccgtcgtaa taggcgggct agcggaaccg 
tggtgcggga gcatgccgcg gccgtgctgg 
agcgcgcttt ccgggatctg ggttttgatt 
ttgccggggt gctgggggtg cggttgccgg 
gggcgctggc ccggttcctg caccaggaac 
cggtgacgac gaccagggca ccggtcgccg 
gatgccgttt tcccggtcag gtgtcctcgc 
gcgtggatgc ggtcgcggac ttcccagccg 
atccggaccc ggaacgggct gggaagacct 
ccgatcggtt cgatgcgggt ttcttcggga 
cgcagcaacg gctgttgctg gagctgtcct 
cgggttcgct gagggggagt cggaccggtg 
gcgcccggtt cgccagccga gccccggaag 
ctgggagtgt cgcgtcgggc cggattgcgt 
cggtggatac tgcgtgttcg 



<210> 6 
<2U> 29736 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 6 

tcgtcgttgg tggcgttgca tttggcgggt 
gcccttgccg gtggtgtgac ggtgatgtcg 
cagcggggcc tggcaccgga cgggcggtgc 
ggttggggtg agggtgctgg tttggtgttg 
gggcatcggg tgttggcggt ggttcgtggg 
ggcttgaccg cgccgaatgg tccctcgcag 
gcgggtctgt cggtgtccga tgtggatgcc 
ggtgatccga ttgaggcgca ggctctgatt 
cggccgttgt ggttggggtc ggtgaagtcc 
gttgccggtg tgatcaagat ggtgatggcg 
cacgtggatg caccctcctc gcaggtggat 
gagaacacgc cctggcccga cagtggtcgc 
atcagcggca ccaacgcgca cgtcatcctg 
gcggagccgg attcgagtcc tgttctggat 
aaaacacccg aagcgctatc cgcccaggcg 
gttgatgtct cccctctgga cgttgggatt 
gagcgggcgg tggtgctggg gtcggatcgt 
gctgccggtc atgaggctgc tggcgtggtt 
gggtttgtgt tcgccggtca aggcggtcag 
gagtttccgg cgttcgccgg tgctttcgac 
gggagggaag tcggggttcg ggatgtggtg 
actttgtggg cgcagtcggg tttgttcgcg 
acgtggggtg ttcggcccag cgtagtgctg 
ttcgccgcag gtgtgctgtc gatggcggag 
ttgatgcagg cgttgccttc tggcggtgcc 
gtcggcccgc tgctcgatgg ggtgcgggat 
QQQtcggtgg tgctttccgg tgaccgggat 
gggcaaggta tccggtcgag gtggttgcgg 
gatccgatgc tggcggagtt cgccgagctc 
ctgccgattg tctcgacgct gaccggaaac 
gagtattggg tgcgccaggt gcgagagccc 
gtggaccaag gcgtcgacac gattgtggaa 
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acgacgtggg agcggtgttc ggcccgaagg 48720 
ctcgtgatac ggacctgtcg ttcttcgcgt 48780 
ctcctggtca gggcaattat gcggcggcga 48840 
ggcggtcaca gggactgcct gcggtgtcgc 48900 
ggatgacgga gacgctcagc gaggtcgacc 48960 
tgtccaccaa ggagggattg cggctgttcg 49020 
tagttccggc gaagttggac aggactttcc 49080 
ccgcattgtt gacggcactg gtacccccca 49140 
agctcgcgga cgagggcacc ctgctcgggg 49200 
ggtattcgag cgcggctgac gtcggggtcg 49260 
cgttgtctgg tgtggagttg cggaaccgcc 49320 
cgactgcggt gttcgactat ccgacgccga 49380 
tggcagacga gatcgctacg acgccagcgc 49440 
aagacgatct cgtcgcgata gtcgggatgg 49500 
cggaggagct ctggcgtttg gtggccgggg 49560 
atcgcggctg ggatctggca ggcttgttcg 49620 
acgtgcggga aggggccttc ctcaccgacg 49680 
tttccccgcg tgaggcgttg gcgatggatc 49740 
gggaggccat tgaacgggca gggatcgatc 49800 
tgttcgcggg gctgatgtac cacgactatg 49860 
gtttcgaggg gtatctcggc aatggcagtg 49920 
actcgtttgg tttcgagggt cctgcggtga 49980 

50000 



cagtcgttgc gttccggcga atgcgatctc 60 
acgcccggga cgtttgtgga attctcccgt 120 
aagtcgttcg cggagagcgc ggacggtacc 180 
ttggagcggt tgtcggatgc tcggcggaat 240 
tcggcggtga atcaggatgg tgcgtcgaat 300 
cagcgggtca tccagcaggc gttggcgagt 360 
gtggaggcgc atgggaccgg gaccaggttg 420 
gctacgtatg ggcgcgatcg tgatcccggt 480 
aacatcggtc atacgcaggc ggcggcgggt 540 
atgcggcacg ggcaacttcc gcgcacgctg 600 
tggtcggcgg ggagggtcca gctcctgacg 660 
ccctgtcggg tgggggtgtc gtcgttcggg 720 
gaacagtcca cggggcagat ggatcaggca 780 
gttccggtgg tgccgtgggt ggtgtcgggc 840 
gcaacgttgg cgacctattt ggaccaaaat 900 
tcgcttgcgg tgacccgttc ggcgctggat 960 
gacacgttgt tgtctggcct gaatgcgctg 1020 
acgggacctg tcgggattgg tggccggacc 1080 
tggttgggga tgggccgccg gttgtactcg 1140 
gaagcatgcg ccgagctcga tgcgaacctg 1200 
ttcggctccg acgagtcctt gctggatcgg 1260 
ttgcaggtcg gtctctggga attgttgggt 1320 
gggcattcgg tcggggagct agccgcggcg 1380 
gcggctcggc tggtggcggg tcgtgcgcgg 1440 
atgctggcgg tgtccgcgac cgaggcccga 1500 
cgtgttggtg tcgcagcggt taacgctccg 1560 
gtgctcgatg gcattgccgg tcggctggac 1620 
gtttcgcacg cgtttcattc gcatcggatg 1680 
gcacggagcg tggactaccg gtctccacgg 1740 
ctcgatgacg tgggcgtgat ggctacgccg 1800 
gtccgcttcg ccgacggtgt ccaggcgctt 1860 
ctcggtccgg acggggcgtt gtcgagcttg 1920 
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gttcaagagt gtgtggcgga gtccgggcgg 
cgtgatgagg tccgaacggt gctggacgct 
gtggactggg ggtcattttt cgctggtacg 
gccttccaac gacagcggta ctggctggag 
ggcctgaccg gggcggagca tccgctgttg 
gaggtgctgc tgaccggcag gctgtcggtg 
gtgctgggcg aagtcgtcgt ccccggcacc 
agccaggtcg gttgtgaacg tgtggaggag 
gagcggggcg ctgcggcggt gcagttggcg 
agtttgcagc tctattcccg aggcgctgat 
gggctgttgg cccaggccaa tgcggtgccg 
ggcgccgggc aggtcgatct ggcggagttc 
tacggtccgg tattccaagg gctccgcgcc 
gaattggccg ggtcaccaga cgcctcgggt 
gcactgcacg cgatggcgct tggtgcttcg 
tggcgtggcg cccagctgta ccgcgctgaa 
ctgggctccg gtgcagtctc attgacgttg 
gtggaatcgc tttcgacgcg accggtctcc 
caagagcggc tgctgcacgt cgagtgggta 
acctcctgcg cggtggtcgg tttgggcgaa 
ggtgtccaag tcgagtccca tgcggacctt 
ggttcagctc ctggtgcggt catcgtcccg 
ccgaccgccg cgcgaagggc gacgcaacag 
gatgaccggt tcgtcagtac gcgcctgatc 
gctggagaag acgtgctcga cctggtacac 
caagcggagc acccggaccg attcgccttg 
acggcactcg ccgaagcgct gactgcggga 
gtgctggcgc cccgcctcgg ccaggtgaag 
gaaggcaccg tgctggtcac cggcggaacc 
ctggtcagcg cccacggtgt gcggcacctg 
cccggagcgg atgagctggt ggccgagctg 
gcgtgcgact cggcagatcg ggactcgctt 
aacccgttgc gggtggtggt gcacgccgcc 
atgtcgccgg agcgcttgga cgcggtgttg 
cacgagctga ctcgggaact cggtctgtcg 
ctgttcggcg gtgcggggca gagcaattac 
gcgcattgcc ggcaggccca ggggctgccc 
agtatcgatg gaatggcggg cgacctcgct 
ggcattggcc cgctttcggc accgggaggg 
gacgaaccgt tgctggcacc ggtgcgactg 
tccgtgcaga cccggattcc ggaaatgctg 
actccgttca cttccagggt tgagccgttg 
gaacgtcggc agcaagtgct ccagcgcgtc 
ggcaggtcga gcgatgtgga catcgagaag 
acggccatcg aactccgcaa ccgtctcgct 
ctggccttcg accacggcac tgcggcggca 
accgcgaccg cgccggcacc gaggcgaacc 
tcgctcttcc aacaggcgta tgcggctggc 
gtcgctgccc agttgcgacc ggtgttcggt 
cccgtccagc tttcccgtgg tcccgaagag 
gggatgccgc ccgcacagca gtacgcgcgg 
gtttcggtga tcccgatgcc tggattcatt 
gtggcggttc ggacgcaggc ggaggcggtg 
ctggtcgggc attcctccgg gggctggctg 
cgcggggtcg tcccggccgg ggtcgtactg 
ccgaggttct ccgtggcgat ggcccaccgg 
atgcaggatg tcggtatcac cgcgatgggc 
ccgacgccga tcggtgctcc gacgctgttc 
gaagggcggc cgtggacaga tgactcctgg 
gtccaggtgc cgggcgacca cttctcgatg 
gcagtcgcga gttggcttga caaactcaac 
tccttttagg accttctggg cggcaccggc 
ggctcgccga tcttgacggc gcacgatgcg 
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gcgacgggga ttccgttggt gcggagagac 1980 
ttggcgcaga cccacactcg tggtggcgcg 2040 
agggcaacgc aagtcgacct tcccacgtat 2100 
ccatcggatt ccggtgatgt gaccggtgtt 2160 
ggtgccgtgg tgccggtcgc gggcggcgat 2220 
gggacgcatc cgtggctggc ggaacaccgc 2280 
gcgttgctgg agatggcgtg gcgggccggt 2340 
ctcaccttgg aggcaccgct ggtcctgccg 2400 
gtgggggctc cggatgaggc cggccggcgc 2460 
gaagacggcg actggcggcg gattgcctcc 2520 
ccggcggatt cgacggcatg gccgccggac 2580 
tacgagcgcc tcgccgagcg cggcttgacc 2640 
gcatggcggc acggcgacga tatcttcgcc 2700 
ttcggcatcc acccggcgct gctggacgct 2760 
cccgactcgg aagcgcgtct gccgttttcc 2820 
ggagcagcgc ttcgggtacg gctctcgccg 2880 
gtggatgcca cagggcgacg agtcgctgcg 2940 
accgaccaga tcggtgccgg tcgcggcgat 3000 
aggtcggctg aatctgcggg gatgtctctg 3060 
ccggagtggc acgctgcgct gaagaccact 3120 
gcttcgttgg ccaccgaggt tgccaagcgg 3180 
tgcccgcgac cccgagcgat gcaggagctg 3240 
gcgatggcga tgctgcagca atggcttgcc 3300 
ctgctgacgc atcgggcggt ctccgcagtt 3360 
gcgccgctgt ggggcttggt ccgcagcgcg 3420 
atcgatatgg acgacgagcg agcatcgcag 3480 
gaagcgcagc tcgcggtgcg gtcgggagtt 3540 
gtgagtggag gtgaagcgtt caggtgggat 3600 
ggcgggctcg gggccctgct cgcacgccat 3660 
ttgctcgcaa gtcgccgtgg tctggcggcg 3720 
gagcaggccg gcgccgacgt cgcggtcgtc 3780 
gcgcggctgg tggcgtcggt gcctgcggaa 3840 
ggtgtgctgg atgacggtgt gctgatgtcg 3900 
cggcccaaag tggatgccgc gtggtacctg 3960 
gcgttcgtgt tgttctcctc ggtcgcgggc 4020 
gctgccggca acgctttcct ggatgccttg 4080 
gcgctgtcgc tggcctccgg gctgtgggcg 4140 
gcggcagatg tggagcggct gtcgcgggca 4200 
ctggccttgt tcgacgctgc cgttggctcg 4260 
gatgtcgaag cactgcgtgt gcaggcccga 4320 
catggcatgg caatggggcc aagccgccgc 4380 
cacgaacggc tggccggatt gtcggagggc 4440 
cgcgccgata tcgcggtggt actggggcac 4500 
cctttggccg agctgggttt cgactcgctg 4560 
accgccaccg gactgcggct tcccgcgacg 4620 
ctcgcccagc acgtgtgcgc gcagctaggc 4680 
gacgacaacg acgccacgga gcccgtgagg 4740 
cggatacttg acgggatgga tttggtgaag 4800 
tcgcctggcg agctggaatc cctgccgaaa 4860 
cttgccttgg tgtgcatgcc ggcgctgatc 4920 
atcgccgccg ggttccgcga tgtgcgggac 4980 
gcgggagaac cgctgccgtc cgccatcgag 5040 
ctgcaggaat tcgccggggg ctcgttcgta 5100 
gcgcacgagg tagccggtga gctggagcgt 5160 
ctggacacct acatccccgg tgagatcacg 5220 
acgtatgaga agctcgcgac tttcacggac 5280 
gggtacttcc ggatgttcac cgagtggact 5340 
gtgcggaccg aagattgcgt cgcagaccct 5400 
cggccagggt ggactctcgc ggatgccacg 5460 
atggacgagc acgccgggtc caccgcacag 5520 
cagcgcaccg ctcggcaacg ctgacgggcg 5580 
caccccggcg gtgccgcctt ccgtggtcca 5640 
cggcacgcgc gctgatcgtg attccgctgc 5700 
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cgctcgtggc catcggcctg gcgaatcatg 
cgagcccgca ttccgaggtg aggggcaccc 
tcacctcgaa attcgtccga tttgggcggt 
ttcttgttga cagagcggtg agaagccgct 
tgctcaagcg ccaccgtttg acgaccgcca 
tcagcggctg cggaaccgcc gccgcacttc 
gcaatgttgg tatggcctcg ggcgggggcg 
aggccgattt cctcgccacc gcgacaccgg 
cgtacgggaa ccggtcggac aagacctgca 
gcgtggacat gagcaactcg ccgatcgagg 
acgcgcccaa ggaattcacc ctccagcccg 
tcctggccga cagcggcgac ccgaacgccc 
cggacatgtc cgaggcccag ccggtcaacg 
cgaagtacct gcgagtcagc tcgctggtgt 
gtgagtcccg cagattcgac ctcgccgtgc 
tcgttgtgac cagcgttgtt cacgtccggg 
ggcgcctcca acggggcctg gcatccgggg 
ctgtcgggcg ttgtcacgcc gccggcctcg 
ccagcgcggt gtggcggcgg ccggagccga 
tcgaaccgac gaggaccacg accttcttgg 
gctgcagcat catccaggcg cttgtggtca 
cctgcgaacg atggatggtc accgcgtagg 
cgtagtcgat gtcctcgtcc tcgtcggttc 
gggcggacac gacgccctgc gtgccgttga 
tgcggatctg cgtgaccttg tcgccgacgc 
gcaggccctc cctggccggg gtgatcgctt 
ctgcggggcc tcgatgcatc ggggcgagga 
acttccgcgg aatccggcgg gcgacgacgt 
cctctacgtg gaacaggaag aagtcggtca 
tgattcggtg cgcgttggtc accaccccgg 
gccgcacgtg tggaatcggg gtgccagggg 
cgaccgacgg gagctggtcg acgtcgccga 
ccttggccag tttgttggct aacagcaggt 
ggtcggcgtc cagcgggttg tcccggtcgt 
gcaggcggtg cacggtcgcc gcgtcgtgtc 
gtcccgtcgg cgcggcgagg atcaccttgg 
accgcacggt gaagctcttg ccgcagcctg 
tcagggccag cttgacggcg cgctcctgcg 
gcaaccagtc gagggccttg tgccaatcga 
tggtgttcag cagccgggac agctggttgg 
cgaggtagat cgcgaccgtc ggcacctcgt 
caccttcctc ggtgacgagt tcggcgaggc 
ggatcttcac cgcctcggcg atcagctcgt 
tggactccga cagcgtgaac tgaaggcccg 
ttcccaccgc tttggcgatg gtgtcggcgg 
ccagccggta tggctcttcc ttgacggtcc 
tcttcaccgc cagcgaggtc gagacgccga 
tcgccttctg ctcctcccac gcgtcggcga 
ggacctcgat cagccgcgcg ggttcctgct 
ggtcgacgat cttctcggcg agtttggggc 
agcggcggat accttgcacg gtcgcaggca 
gccgcccgta ctgggggtgc gacccccacc 
gcgcgcccag cagcgcgccg acgaccgtca 
gcgcgacggt gtagccgctc tcctcgttgg 
gcaccgcagt ccacgtggcc gactcccgtc 
ggctgtcaag caaaccggcg gtcaccacat 
cggcgtcagc gggtgggcgt tcgcgatgcc 
ttcgtcctcg cgcagggcct ggagtaggtg 
ggataaaccg gcttgacgga ggagatggaa 
gacgaccccg agtcatgcgt cgacgacctg 
acgtgggggc gtgcgatcag gtcgcggcgc 
agccgcgacg gactggttcc gctcctcgag 
cgatgccggc cgcgttcgcg aggtggccag 
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tcctttcggg caacgtcaaa cgaattcgtc 5760 
ttgggtggct gagccgctca agggtgcccc 5820 
ggacgcaacc ccggtgggcg tggtgcgtct 5880 
gacacacctg agaggaaaag gggagcatga 5940 
tcaccggcct tctgggggga gtactgctgg 6000 
agtcctcgcc ggcgcccggg catgacgcgc 6060 
gcggggacat cggcacgtcg aactgctcgg 6120 
tgaaaggcga ccccggcagt ttcatcgtgg 6180 
cgatcaacgg cggcgtgccg aacctcaagg 6240 
acctgccggt cgaggacgtg cggcttcccg 6300 
gtcagagcgc gtacgccggc attggcatgg 6360 
atgtcctcac cgggttccag tcctcgctgc 6420 
ttctcggcga cggcaacgtg aagttcgccg 6480 
ctaccgcaga cgagctgcgc taaaacccat 6540 
ggcgcctccg gcgaagcgtc cgtacgtttg 6600 
cgcagcgctg gtacatactc aggcgtctcg 6660 
ccgtcgagtg cggcggcgct gacgcgttct 6720 
aaccggtccc gccccgtcgg agccggtggt 6780 
cggtgcgcac cgcctgcccg agggcctttt 6840 
cccgggtgac cgccgtgtag agcaggttgc 6900 
aggggatcac cacgcacggg tattcgcttc 6960 
cgtggaccag ttcgtcgagt tctgtgaagt 7020 
gcacggtcat ggtctgtgct tcgttgtcga 7080 
acacgccgtt ggcgcccttg tcgtagttgt 7140 
ggaagatccg tccgccgaac cgccgctctg 7200 
cctgcaacag ctggttcagc gcgcctgcac 7260 
cctgcacgtc ggtgcgcggg ttgaaccgga 7320 
cgacggtgag ctcggcggtc ggttcgcttt 7380 
gcccgtgtgt cagcggatag tccccggcgt 7440 
actcggcggc ctgccggaac acctcgttga 7500 
cgagcagatc gcgcagtacc tcaccggctc 7560 
ccagcagcag gtgcgcgccg ggcgcgatcg 7620 
cgagcatgga cgcctcgtcg accacgacga 7680 
aggcggcgtc cccgcccggc tggagttgga 7740 
cggtgagctc ggtcagccgc ttcgccgctc 7800 
cctttttcgc ctgagctaat gcgatgatcg 7860 
gacctccggt gagcacggcg accttctcgg 7920 
cctcggcgag ttcggcaccg gtagcgcggc 7980 
cgtcggcgaa gacgggcatc cggtccgcgc 8040 
ccagggcgac ttcggcgcgg tggaagggca 8100 
cgtcatcggt ggggatctcc tcgcggacca 8160 
attcgatcac cagcccggtg tcgacggcga 8220 
tctccggcag gtagcagttg ccgtcgccgg 8280 
cctttacccg ctgcggggag tcgtgcggga 8340 
tcttgaaacc gattccccac acgtcgcctg 8400 
ggatcgcgtc gtcgtggtac tgcttgtaga 8460 
cgccttgcag gaagatcatc acctccttga 8520 
tcagcttcgt ccgcttcggg ccgagcttgg 8580 
cgatgacgtc gagcgcggcg acgccgaagt 8640 
cgatgccctt gatcaggcca gaccccaggt 8700 
gcacggtcgt gtagtcgtcg acgtggaact 8760 
ggccgcgcat gcgcaacgcc tcgccgggct 8820 
ccaggtcacc gccccggccg gtgtcgatcc 8880 
cgaacgtgat ccgctccagc gtgccctcca 8940 
ctttttccac cgacaacacg tatcacgaac 9000 
gcagcggcat ctcccgaacg cctcgggctc 9060 
ttggtgcggc cggtgggagt tgtagatttt 9120 
ccgctggctc cagatcaggg gaaagcgccg 9180 
gagctctacg tcgaatggcc agcgacccac 9240 
cgaggccgtg gcgaagcgtt gacagggggc 9300 
cggctcagcg ccgtgtccgg acgcaccagc 9360 
cgcggtgtcc tcgatcgtct ttttcgagct 9420 
tccagtcggt cggcagtcgt ggtcgcagtc 9480 
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tgccttttga acatctggtc aattcagctt 
ccgccaattg tttgctcgcc gtgcgccctg 
gctggcgaag gtgtcgattg gctcggacgg 
cacgagcgtc ggcgatgccc ggtactatct 
cgggcgcggt gcgctcctga tcatgaatca 
tcgtggggca ctgtgaggat gtgcatcaca 
atcgcgggct gccgcggctg tgcccagtcg 
gtcagcccct gcgcgtcgag tgcgactttg 
tcggttgccg gtgcgttcgc ctgcaccgac 
tcggtgacga ccgagccatt cgcggtgcgg 
aactccgcct gctgcggtcc gttcgcgtcc 
caggcgtggt cgttgttgtt catcgactgg 
aacgtcagac cggtgcggcc cagcgaggtc 
tagaactgcg gcctgcgtag gaacaggaag 
aagtgcgcct gttggtatgg aagtttcgcg 
tagcgctcgc cgtagatcgc gtgggtgagg 
gcgggcaccg gtgccgggtc ctcggcccag 
ctggacagga acgcgcccaa gttcggcacc 
cgttccgggt ccggtcgcac gtcggcgtag 
acgttggcga accagccgga gccgtcgggt 
gtgtacttcc gcgccatcgt gacgaggttg 
atctccggca gcatcacctc gaagttgtag 
tagcccgaag gactctgccc gttcgcggca 
cgcagcgcgg ggtcggggtc ctggttcagc 
acctggttcg cgtagtggat tgtgggctgc 
cacgtcatcg ctttgcgcag cgccgaggtg 
gccgcgcgca gcagggcgtt ggtcttgctc 
agcccgtgct cggtcggcga atactccggc 
tagtgtccga gcccggcgtc gagcgcggcg 
cacgtgcggg attgcgtggc gaaccaggcg 
ttgtaggaca ccgccgggct gcgccaccag 
tcggcgacca tcggcgagac cgcggtgagg 
aacaggcgcc ggttcggtgt ggtgggcggc 
agtgcggcga gcagtccgag agcggcggcg 
gtcacgggcc tacctccttg tggccgatca 
gatgataatt cgaattatta tgggcttgac 
ccaattcgta ttggcaagcg ggggtgctcg 
cccggatggc aggagtcagc caggccacgg 
gcaacgtccg gatcgcggcg gagacccgtg 
gctacgtcgc gaacccgatc gcccgcagga 
tgttcaccta cgaggcggtg ttcccgagca 
aaggcatcga ggaacaggcc gaggaggtcg 
aggccaccgg ggagcggcgg cggattttcg 
gcagtctgct gctcggtcgc acggtcgacc 
gcatcccgta cgtctccatc ggacgacgcg 
gggccgacta ccgcaccgcg gtgcgagacc 
gcgggttcgc gtacgtgggg tctggtgggg 
gcttccgcga agccgttgcc gcacatggcg 
tcgatcagct gcgcgaagcg ggcgtcaccg 
ccgcgctcgt gctcgccggg cgcgaacgcg 
tctcgctcgg tgccgctacc cggtcggcac 
gcatccccag gcgcgagatg gggcgccggg 
acggcaccac accgcaagaa ctgctcccgt 
caccacgcct ttgaccagga ggaactgttg 
ggcggcggac tcggcggtgt cgccgccgca 
gtgctcaccg aggagtacga ctggctcggc 
gacgagcaca gctgggtgga gcgcttcggc 
ggcatccgcg actactaccg ccgccactac 
gagctcaacc ccggtgcggg caacgtgagc 
gcggtgatcg acgagatgct ggcgccgttc 
ccgtaccggc cggtggccgc gcacaacgac 
caccgcgaca ccggtgaaca gatcgagctc 
acgggtgaac tgcttccgtt gtccagcacg 
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tgaatgatcg ttcggcgagt attgtcctgc 9540 
ggacaggagc gcggcgttcg gggacggcgt 9600 
ccggaccagg atcgccgacg tcgtgatcgc 9660 
ggagcaccag cgagtggaca acgacttcca 9720 
agaactgaga ccgtcagggt gaacgtgccg 9780 
cggccggggt aggtgatcga ggaggccctg 9840 
aatcgcagca ccacctggcc gcgggtgagc 9900 
tcggagtccc tcaccaccaa cgggatctgc 9960 
ctggtgatcc ggtcgggccg cacggtcaca 10020 
tagcggaacc ggtatggctc gccgcccagg 10080 
ggcacctggc ccgggaagac cgtcgcccag 10140 
acgaccgtgc ccgcgaccgg gtgccacagg 10200 
gcgcgcctcg tgccgaagta gccgccgagg 10260 
tcttgcccct ggtcgctgcg gatctcggtg 10320 
atcgcggccg tgcgctgggc gcgggtcgga 10380 
atgcgcggtg aggtgtcctg tttgacgagt 10440 
gccgctcgtg cggcggcgag gtcttcgcgg 10500 
actggcacga actggttgtt cagcgcggta 10560 
aaccgcgccg aggtgcggga gttgggggcg 10620 
tcgcgcagca ggttgtagct gagccagtcg 10680 
gcgttgttgg actggcgcca cgcgtcggcc 10740 
ttggtgtccg cgccggtcaa ctcgtagaag 10800 
aggcgcgcga acgcctcggt caactgctgc 10860 
gcgagcgagg agccggcgag cccggcggta 10920 
caggcgctac cgttgcccgg gttgaggagc 10980 
atctgcgact ggcgttgcgg caggatgttc 11040 
aggtacccca acccgaagcc ggtggcggcg 11100 
cacgagccgt catcgtgctg caaccctagg 11160 
agcagagtcg cgtcgccccg gtacgggttc 11220 
agcgtgtaga cgtgctcctg gacgcgggcg 11280 
cctccggcga agaagccgct cgagtccatg 11340 
tacgacgcga atcgctgctc ctcgggtgcg 11400 
ggggcgggca gtgcgagtgc gcgggtcggc 11460 
ccggtcatca ggctgcgacg gctgaacgta 11520 
accctcaccc gctgcgtagc cgcacgtcaa 11580 
gacgcgtagg ccgacgacgc agaattcctg 11640 
tggcccgacg gctcacccag caggacatcg 11700 
tgtcgctggt gctcaacaac cggaaggacg 11760 
cgcaggtact ggaggtgatt cggaagaccg 11820 
tgcgcgatcg gcacaaccgc atcctcggcg 11880 
cccacgcgaa cttctaccag tcgttcctcg 11940 
gctgtgacct gttgttgttc accagcgcca 12000 
gcgacgacag ccgggtgcgg ctcgccgacg 12060 
gcgacgacct gacccagctg ctcgccgaaa 12120 
acgacgcggg cggtccggtg ccgcacgtcg 12180 
tggtggaccg cgcggtcgcg ctcggccacc 12240 
gcgcggagtc gtccgcggat cggctgcgag 12300 
tccaagggat gcatgtggag accccacagc 12360 
ctgtgctcac cgaagaggtg tcggacgggg 12420 
ggctctccgt gccgggcgac ctcgccgtgc 12480 
cggacgacga cgtggcgctc accggtttcc 12540 
cggtgcaggc gctgaccgag gtgctcgaaa 12600 
gcgagttcgt cgagggctcg acgctgggcg 12660 
ctcgaccaca ccacggacgt tgtcgtcgtt 12720 
ctcgcgttgc tgcgcgcggg ccgccgggtc 12780 
ggccagctga ccagccaggc cgtgccgccc 12840 
gtcaccgcga gctaccgggc gttacgcgac 12900 
ccgctgaccc cgcgcgcacg ggcgtggcgg 12960 
aggctctgcc acgagccccg cgtcgccgtc 13020 
cgcggcagtg gcaggctgac cgtgctgcag 13080 
ggcgaccgga tcgtgtcggt gaccgttgcg 13140 
tccgcgccgt acatcctgga cgcgacggag 13200 
gagtacgtca ccggcttcga gtccactctg 13260 
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gacaccggcg agccgagtgc gcccgacgtc gcgcagccgg cgaacatgca ggcggtgtcg 13320 
gtgtgcttcg tggtcgacca cgtcgacggc gaccacacca tcgacaaacc ggcgcggtac 13380 
gacttctggc gcgcgtacca gccggacttc tggggcgacc ggatgctgtc gttccgctcc 13440 
cccaacccgc gcacgctcgc gatctccgaa cgtacgttca ccccgaaccc ggacgacgac 13500 
ccgctcggcg tcgtgtcgga ccagcggctc agtgccggtg acagcaatct gtggacgttc 13560 
cggcgcatcg ccgcgcgtcg caacttcgtc gagggtgcct acgacagcga catctgcctg 13620 
gtgaactggc cgatcatcga ctacttcgag tcgccggtga tcgacgtgcc ggacgccgac 13680 
gcgcacatcg ccgcggcgcg ggaactctca cgttcggtgc tctactggct acagaccgag 13740 
gcgccgcgcc cagacggcgg caccggcttc cccggcctcc gcctgcgcgg cgacgtcacc 13800 
ggcagcgcgg acggtctcgc gcaggcgccg tacatccgcg agtccaggcg catcagggcc 13860 
gagcacacga tcgtcgaaca ggacctctcg ctcgccgtgc gcggcgacaa gggtgcggtg 13920 
cagcacgccg acgccgtggg tgtcggcatg taccgtatcg acctgcaccc ctccaccggt 13980 
ggcgacaact acatcgacgt cgcgagctgc ccgttcgaga tcccgctcgg cgcgctgatc 14040 
ccgcaacggg tggagaacct gctacccgcg ggcaagaaca tcggcaccac ccacatcacc 14100 
aacggttccc accggctgca cccagtcgag tggaacgtcg gcgaggtcgc gggcgcgctc 14160 
gctgccttct gcctggcgca ccgagtcacc cctcgcgcgg tgcgcaatac ccctggcctg 14220 
ctcgcggact tccagcagtg tctggaacgc gacggggtcg agctccgctg gccggacgtg 14280 
tccggctact gacgcaggga gacgaaaatg acaaagctgt cacgacgact cacggcactc 14340 
atcgtcgcag ggctgttcgc cctcaccggc tgcggtggtg gatcaaccgc acagtccgga 14400 
ccgaagtcgc tgcgcatgac cgtgtggact gccaacgcgg cgcatctcaa gctgctcaac 14460 
gacatcgccg ccgagtacaa ggcctcgcac ccggacatcg ccgagatcaa gttcgactcg 14520 
gtgcccgccg acggctacac caccacgctc accacccaga tcgccggcgg taacgcgccg 14580 
gacctggcct ggatcctgga ggagtcggca ccggacttcg tggcgtccgg tgcgctcgcc 14640 
ccggtgcgcg gcaagatcga gaaggccgac gagctcgtgc cgtccgcgac gaagctgtgg 14700 
gagaaggacg gcgaactgta cgcctacccg ttctccacct cgccgttcgg cttgttcgtc 14760 
aacaccgacc tggtgaaggg cgcctcggcg gactggacct gggaccaggc gatcgcggct 14820 
gcctctgcgt cggcggccgc ctccggcaag ggcggcctgg tactgccgga cttcaagtac 14880 
cagaactggg cagtgctgtc ctctatctgg cgcggctggg gagctgatgc gtggagcgcg 14940 
gacggtcgct cgtgcgggtt ctccagcagc gagatgaacg acgcgatgtc cttcttgcac 15000 
aaggccatct tcaccgacaa ggcgattccg ggccccggca cgacggtgga cttcttcgcc 15060 
ggcgacgcgg cgatggcgat cggccagatc tcccggtcca gtgcgttgaa ggacgcgaag 15120 
ttcggctgga cgctgctgcc gctgccggcc ggtccgaagg gtgactacgc ggtgatcggg 15180 
caggccggga tcggtgtgct gaagcagtcg cacaacgtcg acgccgcgac ggacttcctc 15240 
gccttcctca ccaaccagac caactccgcc aagctcgcgc agttcttccc cccggcgcgg 15300 
tcgtcgctgc tcaacgcgga gacgctcgcc aagagcaacc cgttgatcaa ggccgagcaa 15360 
ctgcagtccg tcgtcgtcga cggcatcaac aagggcgtcg tgaagccgag ccacaagggt 15420 
caggaggagc tgaaccagac gatccgcgcc gcgctcgacc cgttgtggaa gccggacgcg 15480 
aacgtgcaga acgtgctgaa cgacgtgtgc accaagatca aaccgctgct ggagaacaag 15540 
tgacggcggt cgcacactcc acgcgggccg tagcccgcaa ggggacgtcg tactggacgc 15600 
agcgcaggcg cgacaatctg gttggttacc tgttcgtcgc gcccgcgctg ctcggcagca 15660 
tcgcgttcgt gctggtgccg ctggccttgg tcggctggta cagcctcaac gagtggaacg 15720 
tgctcgccgg cacgttcgag ttcgtcggcg cgcagaacta ccaacagctg ctcgccgacg 15780 
agaagctgcg cgactcgctg gtggcgacca cttggttcgc cgccggcctg gtcgtgctca 15840 
acctgtcact agcgctgctg ctggccgtgc tgctcaacca gaagctgagc ggcaccacgg 15900 
tcttccgcac gttgttcttc tctcccgtcc tcgtgtcgct ggtggcgtgg acgctggtgt 15960 
ggcagctgat actgcagccg gagggcagcg tcaacggact gctcgggttc ttcggtgccg 16020 
acgggccgaa ctggctgcgc ggtgagtcga ccgcgatggt ctcggtgatc gtcgtgcagg 16080 
tgctcaagaa cgtcggcctg aacatggtgc tgttcctcgc ggcgctgcaa ggcgtgccgc 16140 
agccgttgta cgaggcggcg aagatcgacg gggcgggtgc ctggacccgg ttccgccgca 16200 
tcaccttgcc gttgatcagc ccgacgatcc tgctcacgtc gatcatcacc atcgtcggct 16260 
cgctgcaggt gttcgcgcag atcgcggtgc tcacgcaggg cggtccgggc acgtccacga 16320 
ccgtactgat ctactacctg taccagcagg cgttccagtt ccaccacttc ggctacggcg 16380 
cgacgatctc ggtactgctg ttcgtcatcg tcgccgcact caccctgctg cagtggcaga 16440 
tgcgccggaa gtgggtgctg catgaggctt agggtcaaga tcacgctata cgggctgctg 16500 
tgcctgctgt gcgtgccttt cgtgttccca acctggtgga tgatcacctc gtcgatgaag 16560 
cccatcagcg agatcttctc cacctcgccg ctgccgtcgg agtggacgtt ctccacctac 16620 
cggcaggtgt tcgagatgca gccgttcgcg cagcagtact ggaacagcct ctacatcgcg 16680 
gtgatcgtca cgacaggcac aatggcggtg gcggcgatgg ccggatacgc gttcgcgcgt 16740 
atccggttcc cgggccagaa cgtgctgttc gtggtcgtcc tgatcggcct gctcatcccg 16800 
agcgaggtca ccatcgtgcc gctgttcaag atgttccagt cgctcggcct gaccaacacg 16860 
cactggccgc taatcatcgt gccgatcttc ggggcgccct gtgtgctggc gatcttcatc 16920 
atgcgccagt tcttcatcgc cctgccgtcg gagctggagg aggccgcgcg gatggacggc 16980 
ctcggacgag cgggcatctt ctaccgggtg gccctgccgc tgtcccggcc cgcgctgggc 17040 
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gccgtcgcga tcttcacgtt cctgcactcg 
ctgtccacac cggacatgta cacgttgccg 
ggcggcccga tgtggaacgt gcagctcgcc 
gtcgtgttcg tgctggcgca gcggcagttc 
gggtgagtcc cgcgatcgac ccggagatcg 
tgacgccagc gggaatcgac gcggtccggg 
tgacgcgcgg tggcaccgtg acgttcgccg 
tgctgcgccc agcgggtgtg ccacgtcttc 
tcgcgggcac ccggcgcacc gacctgcacg 
tcgtgctggt gtcgccggag taccggctcg 
aggactgctt ccgggtgctg gagtggatgt 
cgggtacatc ggcgggcggc gggctggcgg 
gcggtccgcc gatcttgccg tcggcgcgca 
acggcgtgct accgcgccag tgcagcaggt 
gcagactgcg ttcgcacgcc tcgacggcgc 
tgctgaccag gaaaacctcg tcgcggcgcc 
cgcccccgct gccgtacatc tcggcggtgt 
gccgcagcgc ggcgacctcc gccgcgcgct 
cctgccccag caccggcaat tcctgcccgc 
cgtttcagtc agcgctgctg cggcagcacg 
ggcccggcgg ccaaccgccc accggccatc 
gacccgccgc tgccgacctc gatggccggt 
gcgacgtgca gcgagccctc gcggaacacc 
tgcttggaag cgttcgtcaa cgcctcctgg 
ggcgacaccc cgtgcgggtc gcggatctcc 
cgcgcccgca cgaccagctc cgggatcgtg 
agcccgagcg tcgcgcgcat ctcgtccagc 
gcggtctcct tcgcaccggg atcctcggtg 
attaaggtca cgtgatgtcc caccgcgtcg 
tccgcgcgcg ccgtctcggc cttcgcctgc 
cgcagactgt ccgccagctg ctcccgcaac 
cctgccgcca gcagcacgaa cgccacggtc 
atgatcatca ccggtgccac ggcgaccgcc 
tggatccgcg gctgcgcgcg ccccaacgcg 
aaaccaccgg ccaacgctgg catgcacagc 
ggcagcagcg ccgaagccac caaaccggac 
gccagcacag ctgccgccgg aatcccgacg 
cgcaccgaac tcaccggaag ccgcccgcca 
ggttctgcgc gccgagcttg ctcaacaccg 
tcagcccgag ccggaccccg atatcggcgt 
cctgccgctc ccgcccggac aacccctgca 
gactgcggaa cctgggcaac aaccgcgcag 
ccgccagatc gtgcaccgcc cgaaccagca 
cgtgcgcccc gaaccgcaag gcatcggcca 
ccgccggcac cggcggattc gccaacgtcg 
tgccgggcat ctgcacgtcg atgagggcga 
gcaactcatc cccgttgccc gcctcggcga 
acacccgcag gccacgccgc agcatcgcct 
ccccgccgtc gtcagcccgc acctctcggt 
ctagcggggc cgcgaaagcg cagcgcacac 
cgcccttggc cttccgctcc cgcacccgca 
cgaacgtctc ccggaacctg cgctccagga 
tggtgaacag cacgatcgtc ggcggccgcg 
gctgcttgcc gccccgcacc ggcggcgggt 
tcaaccggcc ggtggagatc cgggtgtccc 
ccagtttcgc caccgcgcga ccggtcttcg 
cccgcaccag ctcgcggtcg atctccttct 
ggtcccactt gttgtaggcg atcaccaatg 
cccgcaggtc ctgctcgctc aacggctcgg 
ccgcctcgat cgcggccttg gtgcgcagcg 
tcacgcgctt gcgcagcccg gcggtgtcga 
ccagcgagtc caccgggtcc accgtggtgc 
ccccggtcag cttgttcagc aggctcgact 
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tggaacctct acctggagcc gatcgtctat 17100 
caagcgctca cgcagttcgt cgacaactac 17160 
gccgcgacca tgaccgcagt gccgattctc 17220 
atcgagggac tcgcgcacac cggggttaag 17280 
ccgcgctggt agacctcgcc agaccaccgg 17340 
cgggaggacg cgtcgtcacc gatgccgagc 17400 
atgccgatgc cgatggcgtg cccgtgcttg 17460 
cggtgctgca cctgcacggc ggcgggatgg 17520 
tgctggccga gtgggtgtcg gagctgggtg 17580 
cacccgagca cccgcacccg gcgccgtcgc 17640 
cccgcaacgg cttcggcccg cctgtcgttg 17700 
ctgcggtgac gctgatggcc cgcgacctcg 17760 
actgctcgaa cgcgtccagc gtctccgcca 17820 
agacgtccag gtgatcggtg cccaaccgcc 17880 
cacgccgacc cgcgttgtgc ggatacacct 17940 
cggcgatcgc ggcaccgacg acctcctcgg 18000 
cgatcagccc caggccgagg tccaggccgc 18060 
gcccacgccg ctcccccatg ccccaggtgc 18120 
cacgcagttg aaggcctcgc atgcacactc 18180 
gccaccagct cgaagccacc gtcctcggac 18240 
tgcacccgct ccgacaggcc ggccagcccg 18300 
ccccgccggt cgggggcgcc gtttgtgacg 18360 
agccgcacct tgatgtcagc accgggtgcg 18420 
acgacccggt aagccgcgcg ctcaaccgca 18480 
gaggaaagct ccacatccac accggcggcg 18540 
ctcaacccgg cccgctgcct gggggcgccg 18600 
gcctgcttcg ccaacccgcg cagccgcccg 18660 
gtggccgcca acgccgccga ctcgaccgcg 18720 
tggatctccc gcgcgatccg ggcccgctcc 18780 
aactccgcct cggtggccgc ctccagcttc 18840 
gccaccagcg cccccaatgc cgtcggcgcg 18900 
aggatgatcg acccgacccc cagcgactgc 18960 
gcggtcagca acacccagcc gaccaacatc 19020 
tacaacgcca cgaccgtcgg cgcccaaccc 19080 
agcaccgcga tccgcggcca ccgccgcaac 19140 
accgtcgcat agccgaacgg gtcgggcgcg 19200 
accagcgcct caacgccgag ccgagaccag 19260 
cgcgctgcgc gatcaacgcg gcctgcaccc 19320 
tcgacacgta gctcttgacc gtcgcctcgg 19380 
tggaccgacc atccgccaac aactggagca 19440 
ccagctgcgt ctcgtgcgcg ttgtcggaca 19500 
tgatccgcgg atccagcacc gccccaccgg 19560 
ccgcgggctc cgcgtccttg agcaggaacc 19620 
cgtaatcgtc caggtcgaac gtcgtcagca 19680 
ccaagtcccg caacgccgtc agcccgtcct 19740 
cgtcgacgcg gtacgcgcgc accgcggcca 19800 
ccacccggat cgagccgtcg ccctccaaca 19860 
cgtcatcggc aagcacaacg cggatcggcc 19920 
cacctccctg ttccgagggt cgccagcaga 19980 
gccacccgag cgcactgtgt cagcggcgcc 20040 
ccgagatccg caccgggcta cccacgaacc 20100 
accgccggta cccggcctcc aagaacccgg 20160 
actgcgcctg ggtggcgaac aggatcttcg 20220 
tggccgcgac cagctcggaa agccaaccgt 20280 
acgaatccag cgccgtgcgc agcgtcgggg 20340 
ccgacacgtt gacccgctcg gcccaccgca 20400 
ccaactggtg gcggcggtcc tcgtcgacca 20460 
cccgaccggc ctcgacgacc atagtgatca 20520 
agccgtcgat gagcacgatc accacctcgg 20580 
acgcgtagta ctcggtgccg cttgcggtct 20640 
cgaaccgcca cacctggccg tccagctcca 20700 
ccgcgacgtc gtgcaccacc gaccgctctt 20760 
tgcccacgtt cggcttgccg accaatgcca 20820 
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cccgccgcgg cccaccggtc gcgccgaaga 
ggacggcgtc cagcaggtca ccggaaccgc 
cgcccaaccc cagcgaccac agcgaatgca 
tgttggcggc cagcagcacc ggacgcttgg 
cggtctcggt ggcaccgacc cgggcgtcca 
tggccagttc cgcctgcgcg gccaccgacg 
caccggtgtc gaccaccgtg aagcggcgcc 
cccgggtgac accaggggtg tcctgcacga 
ccagcgtcga cttgcccacg ttgggacggc 
tcggctcctc gccctcgacc gcttcgacct 
accacgtgcc gtccaggcct tccaccgact 
gtcttggctc tggtccgcga ccagccgatc 
gcgcaccgcc tcggtcgcca cgaccagccc 
cggctccccg aacaacacgt cgatcctcgg 
gcgcgtgccc cggcaggcca ccggcaggat 
cgcaccgtgc tgcgcattgg tcacatcgcc 
gacgagacca ccggcccgca gcacccgcac 
accgcggcgc accccgatct gcccgatgcg 
catctcctgc ttgatcagga acaccgcgtg 
gtccatcatc gaactgtggt tggccaccag 
cgcgccgtgg atccgcaccc ggtacggcag 
gcaccggtgc attccaccgg acgcgccctc 
cgctccgccg cgaccagcag gccacggcac 
tccaccacac ccaggtccgt ggtgtcgagc 
gccaccttcc gccccgagtc cagcgcatcg 
aggtcaccgg cccgtccctc ggagacgtcc 
gcggacgccg tcaggtagac cttcaaaccc 
cgtccctcga cgaccatgcc gcctggcgac 
tgggcgacca actgctcccg cacctcggcg 
acctccggcc cgcggatctc ccgccccacg 
tccggatcgg tgcccaccgt cagccgcgct 
tccgacgggt cgacctcggc gcgcagcacc 
gtgtcgaggt aggtagcccc caatgcgaac 
gtgcccgaag ggccgtcgag cgccaccaca 
tccttcccga gccggaaccg atcggcccga 
tggtccgctc ctgccgatcc caatcgcatc 
gaaagttcta cgaataaggt cttgggtgtg 
ggcacgcagc aggacttcac cacccgatcc 
gacggccgct tcgagctgat cgtctccgac 
gtcgccttga ccacctccga gaccagcacg 
gtcgcccagc tcaacgagca gcacagcgag 
gtcgccccgg gctcccccta cgccggccga 
acctccgtgt cgatcgtcgc ggtcgtccgg 
gacttccagt tctccgccgg cgacctggtg 
gcagccagcg aaatcctgga aaagggctga 
cctgaccggg acacgcgtca ccaacggcgg 
atacgcaggg cacacgcgag aggaaacacg 
ctgggtgcgg ttttcttcgg tctgggcatc 
tcaccgatcc cgctctacct gatcggcggg 
ctgcacggca tcgaaccgtt cacccacctc 
ttgctgttag gcctggaata ctccgcgggt 
ctggccggtg tcatagacat cgtgctgaac 
ctgggttggg gtccgctcgg cgcgttcaca 
ggcatcatcg cgaaggtgct cggagacctg 
atcctgtcga tcctggtctt cgaagacctg 
gccgtgctgg ccggggtcag cttcctcggc 
gtgatcaccc tggtgctggt ggtcgcgctg 
gacagccccg acccggaggt cttcctgctg 
gggatcgcct cggagctgca ggtctcggcg 
atctccgggt cgaccgcggc gaacgcgacc 
gccgcgctgt tcttcgtcgt gttcggcctc 
ctgccggtgg cgctgctgct ggcggtggcg 
ttcgccgccc ggatgcaggg tgtcggccgg 
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tctcgcgcgg cgtctccggg aacacctcca 20880 
gcccgtgcaa cgcgctgacc ggcatcggct 20940 
cgtcggagac gcctcgctgg tcgtcgacct 21000 
accgccgcag caccctggcc acggcttcct 21060 
ccacgagcag caccgcatcg gcggtgtgca 21120 
cctgcagtcc cttcgcgtcc ggctcccagc 21180 
cgttccacaa cgcgtcgtag gccacccgat 21240 
cggcttcccg acggcccagc agccggttca 21300 
cgaccacggc cagcaccggc tgcggcttcg 21360 
cgtcgaactc agcccactca gcctcgtcgg 21420 
cgtcggtcac gctcgtttcc ctctcgaata 21480 
cagctcggcg atcagctcga ccagttcggt 21540 
ggcgcggcct ttgcccgccg gcagcacgat 21600 
tagcagccta cggcccgggc catccgggcg 21660 
cagcgcctcc gacgtccgcg ccaaccacgc 21720 
atcaccccgg gtgccctccg ggaacacccc 21780 
cgccgccagc agcggggcgc ggtccggcgc 21840 
gcgcaggaac cagcccagcg caccgcggaa 21900 
gcgcgggatc attccgaaca gcagcggccc 21960 
taccaccggg cccgtgcgcg ggatccgctc 22020 
ctgcacgatc cggcgcgaaa tccactggcc 22080 
gggcagggtc tcgtcgctca tggcgtcgtc 22140 
tccacgtgct tgcgcaactg cgtgagcacc 22200 
tccaccgcgt cgtcggcctt gcgcagcggc 22260 
cggcgctgca cgtccgcgtg ggtccggtcc 22320 
tgcgccgtgc ggcgctgcgc gcgggcgtgc 22380 
gcatccgggg ccaccacggt gccgatgtcg 22440 
accagcgcct caccgatcag cttccgctgg 22500 
accgccgaca ccgccgacac cgcgccggtc 22560 
tccgagccgt ccaggaagat ctcagggccc 22620 
gcgcgcacca cgcctgccac tgcggccgga 22680 
gccagcgtca cggcgcggta catcgctccg 22740 
gccaacttgc gggacaccgt cgacttgccg 22800 
ccaaggagcc tggcgtgtgc cacgtgcatg 22860 
cacccggccg gaccaaccgt ccattctgcc 22920 
cagatccaat ccggactgcg ggacaaatac 22980 
gatgtaactg tgaccccctt gccagggctc 23040 
ggccaccgca tcggcgtgat cacctaccgg 23100 
cacgaagacc cagacaaggt ggccgcctcg 23160 
ctggccaacc tgctgggcgc cccgcagctg 23220 
gtcgccggga tcaccacctg gcagctgtcc 23280 
accctcggcg acacggagat gcgcacccgc 23340 
gacggcaccg tgcacccctc accgcggccg 23400 
gtggtcgtgg gcaccgcgaa ggggctacgc 23460 
tccggcccgt cccgaacact ttcccggcca 23520 
tccagcgcgt gccgatcccc taacaactcc 23580 
ttgcaagaca cggcgatctc cctaatagaa 23640 
ctcgggagac tcgcatggaa aatcggggtg 23700 
ctggccttcg gcaccggtgg cctggtcccc 23760 
gcctccgaga tcggcgtggt cctgctgttg 23820 
gaactcgtca ccggcttacg ccgctcctgg 23880 
gcggtacccg gtgcgctggt ggccctgctg 23940 
atggccggcg tcacctacat ctcttcctcc 24000 
ggtcggctcg gcaaccggga aacgccggtg 24060 
gcgatggccg tctacctgcc gatcctgacc 24120 
ggactgaccg cggtcggggt gtcgctggtc 24180 
cgcttcggca agtacgtgtc cgcgctggtc 24240 
cgcctgctgg gggcggcgct gctcgtcgcg 24300 
gcggtcggct cgttcctgct gggcatcgcg 24360 
cgcatgctgg agccgctgcg cgacctgttc 24420 
aacaccgatc cgagccagat cccgccggtg 24480 
acctgcgcga cgaaagtgct caccggctgg 24540 
atgggtcggc tgcgggccgg tgccgcgctg 24600 
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gtggcgcgcg gcgaattctc catcgtgatc 
ccgggcgagc tcgccgccct cgccaccgcc 
gtcgcggccc gcgtcgtcga gccggtggcc 
gcctgaccgg aaaagcctcc ggctgtcccg 
gcggataacg cgggcgggtc tggcaaacgc 
cactgcgcgg tagagcgagc cgatctcctg 
ctcgttggtc agccgcacat cgccgatccg 
gacgtgcttg agcagccgcc gcacgatgcg 
cagcgaccgg ttctggttca tgtcgaccag 
ctccagctcc acgccctcgc gcagctgctg 
ctccgccagg tacgtcttgc gcaccttgta 
gccgtcgttg gtgatcagca gcaggccctc 
cagcttgccc tcccgttccc gcaggtaatc 
catggtgcac aggatgccgc gcggcttgtt 
catcacccgg ttgccgtcca cgtggaccac 
ctcggtgacg acctcgccgt cgacctcgat 
gcgccgcgag gccaccccgg cctgggacaa 
ggcagtgctg ctgtacgggg aacggtgctc 
acaagggggc gagcggcggc aactccttca 
gctcggtcgt gcagtacagg atgccgcccg 
ggccgcgccc caccagggta cggatgacgc 
cccgcgaacg cgtcaccggc tgccggtacg 
gcgtcagctt cgaacgctgc ccgtccaaca 
cccgcgtgta gaaccgccag ccgtcgccga 
cggcgtagcc gtccgagagc cgccgcagcg 
gggtgtcggc cagcagctcc tcgccggccg 
gcgccgcatc cagcgccgaa tcggaggtca 
cggaatcccc agcggccgcc gacggctcca 
tcggcgactc ctccgcgacc tcgcccgcgg 
ctcttcttca tggctggcgc gctgcgcctc 
gcccacccag ctcacgatca gctcacccag 
ctcccggtag agctcgagca gcgccaggaa 
gtccgacacc agctccgaga acgtcgcgct 
cgccgcgtgc tcgcgcaccg aaacgccgtg 
cggtggcttc ggccggaaca ccgcggccgc 
gatcaccact tccggcagca ggttcgcgaa 
gcgccgcagc gccccggcct ccaactcgcc 
gtactgcagc agccgcgcga agagcaggtc 
gtcctcgacc tccgcggccg gcagcagccg 
gaccaccagg aactcggtgg tttcgttcag 
ggcgatgaac tcgtcggtga ccttgtgcag 
cgagatcagc tgcagcagca ggtcgaaggg 
ccggccgcct gcggcgggat cctcgctcga 
tctccgggct cctgcaggcg ccgcaccagc 
gcgagcacca cggcaaccgc ttcgcgcacc 
tcggcccgca atgccagccg cgcctgctcc 
acggtgatct tcgaatcgtg cttgcgccgt 
gcctgctccg ggtccggcgg attcgcgtga 
tcggccgctc cgggcagggc gacccgacgc 
cgcggtacgc ctgcgccccg gccgagcggg 
ccgtggtctc cggaaaccgc accgtgcggt 
acgcctcgac gacgcgcgcc atcacctcgc 
tcgccaggat gccgctgatc tccagcctcg 
tgtcgatcag cagcgccacc ccgcgcaggc 
cggcgtccgc ggcggccaac gcgttcaccg 
ccagcacgta gtcgtactcc gccagcgccg 
ggcccacctc cgcgaccagc tgcacctcgg 
ggtccatgcc ctcgacggtg gtctgccgca 
tgatcacgtt gtagatggtc tggtcgagct 
gcgcgccctg cggatcgaaa tccaccagca 
cgccgaggtt gatcgtcgac gtggtcttcc 
gcaccgatgc gggaccgtgc cggtccagca 
ggccagtggg accgaggccg cgggggttcg 
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gccgggctgg ccgtcgcctc cggcgcggtg 24660 
tacgtgctgc tgatggcgat cgtcggcccg 24720 
cggatgttcc tccccagcca ggcgaagaag 24780 
agcggacggg ccggaggctt tctcaacgct 24840 
aacaccggtg cgcgaacagc ctcagaggcc 24900 
gcggttcaac ggccggatcg cgcccgggcg 24960 
cgtccgcacc aggcgctgca ccgggtagcc 25020 
gttgcggccc tcgtgcaaca cgatctccac 25080 
cttgaaccgg tccaccttca ccagaccgtc 25140 
cccgaggtcc ttcgggatcg accccagcac 25200 
cgacgggtgc atcagccggt tcgccagctc 25260 
ggtgtcctgg tccagccgcc ccacgtggaa 25320 
accgatgcac ggacgcccct tgtcgtccga 25380 
cagcagcagg tgcgtcagat cgtcgttgac 25440 
cgcggtcgtc gggtcgaccc ggcgccccag 25500 
ccgcccctcg acgatcatct cctcggccgc 25560 
cactttctgc agccgcacgc cttccgaatc 25620 
agacgtcatc aatcgaatcc acttcgggca 25680 
acgacgacag ccccagccgc tccaggaaca 25740 
tctccgggtc ggttccggcc tcttcgataa 25800 
cgtcgacgtt cacaccccgt actgccgcga 25860 
cgatcaccgc gagggtctcc agcgcggcac 25920 
ggtaccgctc cacgtacggg gcgtacttct 25980 
cccgccgcag gtcaacgccc cgttccgcct 26040 
cggaccggat ccgggccacc ggctgctcca 26100 
gcacgtcgac cacgagcagc aacgcctcca 26160 
ggtccggcgc actcggcgcc tccaccgcct 26220 
acggttcctg cggctcagcg ggcggcgggg 26280 
cctccggctc cagctccgtc acccgtactc 26340 
ggcttcggcc tccgcctgcg ccacgctgcc 26400 
cggggtctcc tgctcgaaca gcaggacctt 26460 
ccgcgccacc acctcgacgg tgtgcccgca 26520 
gcccttctcc gccaaccgca cccgcagcag 26580 
ctggtggatg tggtccagcg aaacggtcgg 26640 
gatctcggcg aaccgctgcg gcggaacccc 26700 
ccgctcctcg accgacaccg aacgcggata 26760 
gaacagcgcg gccacctgct tgtacgcgcg 26820 
gcgggcctcc agcagcgcca ggtccgcctc 26880 
ggccgccttc aggtctaaca gcgtcgcggc 26940 
gtcccaccgc tcgccgagcg ccttggtgta 27000 
cgctacttcg gtaacgtcca gctggtgctg 27060 
gccctcgaag ttgtccagcc gcaccgtgaa 27120 
ctgcgccgcc ggtaccgcgg ggtctgtcac 27180 
accgagtccg cgccgttctt gtcgaggtcg 27240 
acgcggcctc ggtccacggc gatgccgtgg 27300 
aagcccagca actcctcgtc ggacacgtac 27360 
ccagagccgc cccggcccgc cgcgggtgcc 27420 
tccgtccgag gaacagtagt acggaacaat 27480 
gtcatcgggc gatcacctcg cgcgccagtg 27540 
gcgcccagcg ggtgatcggc tcgccggcca 27600 
tgatcaccgc atcgaagacg atgtcgccga 27660 
gcgagtgcag cgtgcgcggg tcgaacatgg 27720 
ggttcagccg ctcccgcacc ttctcgatgg 27780 
tgaagaactc gcactccagc gggattatca 27840 
tcagcagccc cagcgacggc tggcagtcca 27900 
gctgcagcac ccggtgcagc gtctgctccc 27960 
ccgcggacag gtcgatgttg ctgggcagca 28020 
ccacgtcatg gatggcgacc ggccgttcca 28080 
ggtgcggctg gaccccgagg ccgaccgaga 28140 
gcacccggcg cccgtattcc gccagcgctg 28200 
cgacgccgcc cttctggttg cacatcgcga 28260 
gcggcggctc ggggatgtgc cgacgtggcc 28320 
gcgccaggcc gacctcctcc tcgcggcccg 28380 
Page 67 



ccgtctcccg 
gcggcagcga 
gtggcattcg 
gcgtgtgatc 
ctccgtagcg 
gatcaaggtg 
gcggacatcc 
cgaaaccgag 
ctgccgggaa 
cgcggccagg 
gccgatgggc 
gccgtcgatg 
cagcacttcc 
gagcagcttc 
cggcgcgaca 
caggccgcgg 
cccggtgcct 
gagcccagaa 
ggagtcgaga 
gatcgccccg 
tggggccggt 
gcgtgggcgg 
atccgcaccg 
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ggcgatgctc 
catggcgatc 
ttaccagcgg 
aacgtctacc 
tgtgcgcgcg 
tagacctggg 
gcgccgccct 
ccgttgatcc 
agccgggtac 
ccgggcctgg 
accagccgct 
tcgtcgaggt 
agcagcgccc 
ccgacgtcgc 
tcgcgcgcta 
gcggccacca 
tcccggagct 
atccctgccg 
gtgctgcggg 
cgcaggtccg 
ccggccattc 
cgagcagccc 
cctcggcgag 



aggtcgaccg 
tgactccttt 
caacgcgcct 
gcgtacacca 
gatgcgtcgt 
tcgtcgtgac 
ccacaaggtg 
cggcgcgctg 
cgcgcgaatt 
cccgtaccag 
ggcggccgcc 
ccaacccgac 
ggtcccgcag 
cgatggaaag 
cgtcgaccgg 
aagctcgggc 
ccgccaagaa 
cgaccaggaa 
cggtaccgcg 
gcggcagctc 
ggcatccgcc 
cgccaccccg 
cgggaagcgg 



cgccccggga 
tggtgccggt 
cgccgtggtg 
ccccctcgca 
cgcgtagacc 
cgaggcgtgg 
cgtggcgaac 
cgcggcggtc 
caggaacacc 
gtacgcgtcc 
cttcccttcc 
cgcttcggag 
gccccgggca 
cgccttcggc 
cagcagcccc 
cgacgacgac 
cccggacagg 
ttcggcatac 
ctcgacggca 
ctcgaaaccc 
gggcgaggct 
gggccgttga 
cggatc 



ccgcccttcg 
ggcgatcaaa 
tcggaagttc 
ttcgcagcgt 
tcccgcagcg 
cccagcagtt 
gaatgccgca 
ttcaacgcgt 
gcggcactac 
agcgccgcca 
agcagcacgg 
atccgcgctc 
tcctcgcccc 
agccgcttcg 
tcggcgtgcg 
ggcgccagcg 
tgctccgaac 
cggcgcaggt 
aggtggtcga 
ctcaccgggc 
ccacccggcc 
cgatctcccc 



gcggcgggct 
cgcagcctaa 
cctgtgatct 
caccaagtca 
tgttcaccgt 
cctgcaccac 
gcacgtgcgg 
tccacgagct 
cccgcccgcg 
gcgccggtcg 
tccggttggt 
ccgtcgaata 
cggcgtgatc 
gcgggctcgg 
cgaagcggtg 
cgggccgctg 
gcacctcgcc 
ccctggcgta 
ggtacccggt 
ggcgaagcgc 
ctcgcggacc 
ggccagcgcc 



28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29736 



<210> 7 
<211> 828 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(825) 

<223> ORFl; o-methyl transferase 

<400> 7 

gtg ttg cca ggt ggc gca cca aca 
vai Leu Pro Gly Gly Ala Pro Thr 
1 5 

gac ctg gtc acg ccg ttg ctg aac 
Asp Leu Val Thr Pro Leu Leu Asn 
20 

ate cac cac gqc tac tgg gag aac 



lie His His Gly Tyr Trp Glu Asn Asp Gly Arg 
35 40 



teg cag cag 
Ser Gin Gin 
10 

teg gtc gcg 
Ser Val Ala 
25 

gac g^g egg 



gee gee gac egg etc acc gac ctt 
Ala Ala Asp Arg Leu Thr Asp Leu 
50 55 

gqc gqc gtt cga ctg etc gat gtg 
Gly Gly val Arg Leu Leu Asp val 
65 70 

ctg cgc gtc gcg cgc gac aac gcg 
Leu Arg Val Ala Arg Asp Asn Ala 
85 

age cag gtg caa gtg gec ate gee 
ser Gin val Gin val Ala lie Ala 
100 



gtc gee gaa 
val Ala Glu 



999 tgc ggt 
Gly Cys Gly 
75 

ate cag ate 
lie Gin lie 
90 

get gat tgc 
Ala Asp Cys 
105 

Page 



gtt ggg 
val Gly 



gqc ggc 
Gly Gly 



get tec 
Ala Ser 
45 

egg acc 
Arg Thr 

acc gga 
Thr Gly 



acc ggc 
Thr Gly 



gca cgc 
Ala Arg 

68 



cag atg tat 48 
Gin Met Tyr 
15 

ccc tgc gec 96 
Pro Cys Ala 
30 

tgg cag cag 144 
Trp Gin Gin 



gtg etc gat 192 
Val Leu Asp 



caa cca gcg 240 
Gin Pro Ala 
80 

ate acc gtc 288 
lie Thr val 
95 

gaa cgc gga 336 
Glu Arg Gly 
110 
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eta age cac egg gtq gac ttc teg tgc gtc gat gec atg tec ctg ccg 384 
Leu Ser His Arg vat Asp Phe Ser Cys val Asp Ala Met Ser Leu Pro 
115 ~ 120 125 

tac ccg gac aat get ttc gac gee gee tgg gee atg cag teg ctg ttg 432 
Tyr Pro Asp Asn Ala Phe Asp Ala Ala Trp Ala Met Gin Ser Leu Leu 
130 135 140 

gag atg tec gaa ccg gac cgt gee ate egg gaa ate ctt cga gta etc 480 
Glu Met Ser Glu Pro Asp Arg Ala lie Arg Glu lie Leu Arg val Leu 
145 150 ~ 155 160 

aaa ccc gqt gqc ate etc gqc gtc ace gag gtc gtc aaa cga gaa gcg 528 
Lys Pro Gly Gly lie Leu Gly val Thr Glu Val val Lys Arg Glu Ala 
165 170 ' 175 



gqc gqc gqg atg ccg gtq tec gqg gac agg tgg ccg ace gqc ctt egg 
Giy Gly Gly Met Pro Val Ser Gly Asp Arg Trp Pro Thr Gly Leu Arg 
180 185 190 



576 



ate tgc ctg get gag caa ctt ctg gaa teg ctg cgt gca gcg gqg ttc 624 
lie Cys Leu Ala Glu Gin Leu Leu Glu Ser Leu Arg Ala Ala Gly Phe 
195 200 205 

gag ate etc gat tgg gag gac gtq teg teg agg acc egg tac ttc atg 672 
Glu lie Leu Asp Trp Glu Asp val ser Ser Arg Thr Arg Tyr Phe Met 
210 215 220 

ccg cag ttc gec gaa gag etc get gcg cac cag cac gqg ate gcg gac 720 
Pro Gin Phe Ala Glu Glu Leu Ala Ala His Gin His Gly lie Ala Asp 
225 230 235 240 

agg tac gqg ccg get gtc gee gqc tgg gee gee gcg gtc tgc gat tat 768 
Arg Tyr Gly Pro Ala val Ala Gly Trp Ala Ala Ala val Cys Asp Tyr 
245 250 255 

gag aaa tat gee cac gac atg gqc tat gcg att ctg acg gcg egg aag 816 
Glu Lys Tyr Ala His Asp Met Gly Tyr Ala lie Leu Thr Ala Arg Lys 
260 265 270 

ccg gtc gqc tga 828 
pro val Gly 
275 

<210> 8 
<211> 275 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 8 

Val Leu Pro Gly Gly Ala Pro Thr ser Gin Gin Val Gly Gin Met Tyr 
1 5 10 15 

Asp Leu val Thr Pro Leu Leu Asn Ser Val Ala Gly Gly Pro Cys Ala 
20 25 30 

lie His His Gly Tyr Trp Glu Asn Asp Gly Arg Ala Ser Trp Gin Gin 
35 40 45 

Ala Ala Asp Arg Leu Thr Asp Leu val Ala Glu Arg Thr Val Leu Asp 
50 " 55 60 
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Gly Gly Val Arg Leu Leu Asp Val Gly Cys Gly Thr Gly Gin Pro Ala 
65 70 75 80 

Leu Arg val Ala Arg Asp Asn Ala lie Gin lie Thr Gly lie Thr val 
85 90 95 

Ser Gin val Gin val Ala lie Ala Ala Asp Cys Ala Arg Glu Arg Gly 
100 105 110 

Leu Ser His Arg val Asp Phe Ser Cys Val Asp Ala Met Ser Leu Pro 
115 120 125 

Tyr Pro Asp Asn Ala Phe Asp Ala Ala Trp Ala Met Gin Ser Leu Leu 
130 135 140 

Glu Met Ser Glu Pro Asp Arg Ala lie Arg Glu lie Leu Arg val Leu 
145 150 ~ 155 ~ 160 

Lys Pro Gly Gly lie Leu Gly val Thr Glu val val Lys Arg Glu Ala 
165 170 175 

Gly Gly Gly Met Pro Val Ser Gly Asp Arg Trp Pro Thr Gly Leu Arg 
180 185 190 

lie Cys Leu Ala Glu Gin Leu Leu Glu Ser Leu Arg Ala Ala Gly Phe 
195 200 205 

Glu lie Leu Asp Trp Glu Asp Val Ser Ser Arg Thr Arg Tyr Phe Met 
210 215 220 

Pro Gin Phe Ala Glu Glu Leu Ala Ala His Gin His Gly lie Ala Asp 
225 230 235 240 

Arg Tyr Gly Pro Ala val Ala Gly Trp Ala Ala Ala Val Cys Asp Tyr 
245 250 255 

Glu Lys Tyr Ala His Asp Met Gly Tyr Ala lie Leu Thr Ala Arg Lys 
260 265 270 

pro val Gly 
275 



<210> 9 
<211> 1173 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(1170) 

<223> ORF2; glycosyltransferase 

<400> 9 

atg cgc gta etc gtc gtt ccc ttg ccc tat ccg acg cat etc atg gca 48 

Met Arg Val Leu val val Pro Leu Pro Tyr Pro Thr His Leu Met Ala 
1 5 10 15 

atg gtq ccg ctg tgc tgg gcg ctg caa gca tec gqg cac gag gtc ctg 96 

Met val Pro Leu Cys Trp Ala Leu Gin Ala Ser Gly His Glu val Leu 

20 25 30 
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ate gec gca cca cca gag ctg cag gcg acc gcg cat ggt gca gqt etc 

lie Ala Ala Pro Pro Glu Leu Gin Ala Thr Ala His Gly Ala Gly Leu 
35 40 45 

acc acg gee gqg ate cgc gqg aac gac agg acc gqc gat acg gqt gqa 

Thr Thr Ala Gly lie Arg Gly Asn Asp Arg Thr Gly Asp Thr Gly Gly 
50 55 60 



gqc gca ccg gtc cag tac gtq ccg tac aac gqa age gqc gca ttc ccg 
Gly Ala Pro val Gin Tyr val Pro Tyr Asn Gly Ser Gly Ala Phe Pro 
195 200 205 



144 



192 



acc acg cag ctg cgc ttt ccc aat ccg gcg ttc gqt cag cgc gac acc 240 
Thr Thr Gin Leu Arg Phe Pro Asn Pro Ala Phe Gly Gin Arg Asp Thr 
65 70 75 80 

gag gca gqc egg caa ctg tgg gag cag acc gcg tec aat gtc gcg caa 288 
Glu Ala Gly Arg Gin Leu Trp Glu Gin Thr Ala Ser Asn val Ala Gin 
85 90 95 

age teg etc gat cag etc ccc gaa tac ctt cga ctg gee gag gee tgg 336 
Ser Ser Leu Asp Gin Leu Pro Glu Tyr Leu Arg Leu Ala Glu Ala Trp 
100 105 110 

cga ccg tea gtq ctg ttg gtc gac gtc tgc gcg ctg ate gqc egg gtq 384 
Arg Pro Ser val Leu Leu Val Asp val Cys Ala Leu lie Gly Arg Val 
115 120 125 

etc gqc gqa ttg etc gac ctg ccg gtc gtq ctg cac cgc tgg gqa gtc 432 
Leu Gly Gly Leu Leu Asp Leu Pro Val val Leu His Arg Trp Gly val 
130 135 140 

gac ccc acc gca gqc ccc ttc age gat cga gee cac gag ttg ctt gac 480 
Asp Pro Thr Ala Gly Pro Phe Ser Asp Arg Ala His Glu Leu Leu Asp 
145 150 155 160 

ccg gtq tgc egg cac cac gqa ctg acc gqc ctg ccc act ccc gag etc 528 
Pro val Cys Arg His His Gly Leu Thr Gly Leu Pro Thr Pro Glu Leu 
165 170 175 

ate etc gat ccc tgt ccg ccg age ctg caa gca age gac gcg ccg caa 576 
lie Leu Asp Pro Cys Pro Pro Ser Leu Gin Ala ser Asp Ala Pro Gin 
180 185 190 



624 



gca tgg gqc gcg gcg cgc acc tea gca egg egg gtc tgc ate tgc atg 672 

Ala Trp Gly Ala Ala Arg Thr ser Ala Arg Arg val Cys lie Cys Met 

210 215 220 

gqc cgc atg gtq ctg aac gee acc gqg ccg get ccg ctg ctg cgc gca 720 

Gly Arg Met val Leu Asn Ala Thr Gly Pro Ala Pro Leu Leu Arg Ala 
225 230 235 240 

gta gcg get gee acc gag ttg ccc gqc gtc gag gec gtq ate gee gtt 768 

val Ala Ala Ala Thr Glu Leu Pro Gly val Glu Ala val lie Ala Val 
245 250 255 

ccc ccc gag cac egg gca ctt etc acc gac eta ccc gac aac gee egg 816 

Pro Pro Glu His Arg Ala Leu Leu Thr Asp Leu Pro Asp Asn Ala Arg 

260 265 270 

ate gec gaa teg gtc ccg etc aac ctg ttc ctg cgt acc tgc gag ctg 864 

lie Ala Glu Ser Val Pro Leu Asn Leu Phe Leu Arg Thr Cys Glu Leu 
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275 280 285 

gtc ate tgc gcg gqc gqc teg gqa acg gca ttc acc gcg acc cga etc 912 
Val lie Cys Ala Gly Gly Ser Gly Thr Ala Phe Thr Ala Thr Arg Leu 
290 295 300 

gqc ate ccg caa etc gtq ctt ccc cag tac ttc gac cag ttc gac tac 960 
Gly lie Pro Gin Leu Val Leu Pro Gin Tyr Phe Asp Gin Phe Asp Tyr 
305 310 315 320 

gcg cgc aac etc gee get gee gqg gcg gqc ate tgc ttg ccg gat gag 1008 
Ala Arg Asn Leu Ala Ala Ala Gly Ala Gly lie Cys Leu Pro Asp Glu 
325 330 335 

cag gee cag tec gac cac gaa cag ttc acc gac tea ate gca acg gtq 1056 
Gin Ala Gin Ser Asp His Glu Gin Phe Thr Asp Ser lie Ala Thr Val 
340 345 350 

etc gqc gac acc gqc ttc get get gcg gca ate aaa etc age gac gag 1104 
Leu Gly Asp Thr Gly Phe Ala Ala Ala Ala lie Lys Leu Ser Asp Glu 
355 360 365 

ate acg gee atg ccc cat ccc gec gcg ctg gtq egg acg ctg gag aac 1152 
lie Thr Ala Met Pro His Pro Ala Ala Leu Val Arg Thr Leu Glu Asn 
370 375 380 

act gcg gec ate cgt gee tga 1173 
Thr Ala Ala lie Arg Ala 
385 390 

<210> 10 
<211> 390 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 10 

Met Arg val Leu val val Pro Leu Pro Tyr Pro Thr His Leu Met Ala 
15 10 15 

Met val Pro Leu Cys Trp Ala Leu Gin Ala Ser Gly His Glu val Leu 
20 25 30 

lie Ala Ala Pro Pro Glu Leu Gin Ala Thr Ala His Gly Ala Gly Leu 
35 40 45 

Thr Thr Ala Gly lie Arg Gly Asn Asp Arg Thr Gly Asp Thr Gly Gly 
50 55 60 

Thr Thr Gin Leu Arg Phe Pro Asn Pro Ala Phe Gly Gin Arg Asp Thr 
65 70 75 80 

Glu Ala Gly Arg Gin Leu Trp Glu Gin Thr Ala Ser Asn val Ala Gin 
85 90 95 

Ser Ser Leu Asp Gin Leu Pro Glu Tyr Leu Arg Leu Ala Glu Ala Trp 
100 105 ~ 110 

Arg Pro Ser val Leu Leu val Asp val Cys Ala Leu lie Gly Arg val 
115 120 125 

Leu Gly Gly Leu Leu Asp Leu Pro Val Val Leu His Arg Trp Gly Val 
130 135 140 
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Asp Pro Thr Ala Gly Pro Phe Ser Asp Arg Ala His Glu Leu Leu Asp 
145 150 155 160 

Pro Val cys Arg His His Gly Leu Thr Gly Leu Pro Thr Pro Glu Leu 
165 170 175 

lie Leu Asp Pro Cys Pro Pro Ser Leu Gin Ala Ser Asp Ala Pro Gin 
180 185 190 

Gly Ala Pro val Gin Tyr val Pro Tyr Asn Gly Ser Gly Ala Phe Pro 
195 200 205 

Ala Trp Gly Ala Ala Arg Thr Ser Ala Arg Arg val Cys lie Cys Met 
210 215 ~ 220 

Gly Arg Met val Leu Asn Ala Thr Gly Pro Ala Pro Leu Leu Arg Ala 
225 " 230 235 240 

val Ala Ala Ala Thr Glu Leu Pro Gly val Glu Ala Val lie Ala val 
245 250 255 

Pro Pro Glu His Arg Ala Leu Leu Thr Asp Leu Pro Asp Asn Ala Arg 
260 265 270 

lie Ala Glu Ser Val Pro Leu Asn Leu Phe Leu Arg Thr Cys Glu Leu 
275 280 285 

Val lie Cys Ala Gly Gly Ser Gly Thr Ala Phe Thr Ala Thr Arg Leu 
290 295 300 

Gly lie Pro Gin Leu Val Leu Pro Gin Tyr Phe Asp Gin Phe Asp Tyr 
305 310 315 320 

Ala Arg Asn Leu Ala Ala Ala Gly Ala Gly lie Cys Leu Pro Asp Glu 
325 330 335 

Gin Ala Gin Ser Asp His Glu Gin Phe Thr Asp Ser lie Ala Thr Val 
340 345 350 

Leu Gly Asp Thr Gly Phe Ala Ala Ala Ala lie Lys Leu Ser Asp Glu 
355 360 365 

lie Thr Ala Met Pro His Pro Ala Ala Leu val Arg Thr Leu Glu Asn 
370 375 380 

Thr Ala Ala lie Arg Ala 
385 390 



<210> 11 
<211> 753 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(750) 

<223> 0RF3; O-methyl transferase 

<400> 11 

atg ccc tec cag aac geg ctg tac ctg gac ctg etc aag aag gta etc 
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Met Pro ser Gin Asn Ala Leu Tyr Leu Asp Leu Leu Lys Lys Val Leu 
15 10 15 

acc aac acg att tac agt gat egg ccg cat ccg aac gec tgg cag gac 96 
Thr Asn Thr lie Tyr Ser Asp Arg Pro His Pro Asn Ala Trp Gin Asp 
20 25 30 

aac acc gac tac agg cag gec get egg gec aaa ggc acg gac tgg cca 144 
Asn Thr Asp Tyr Arg Gin Ala Ala Arg Ala Lys Gly Thr Asp Trp Pro 
35 40 45 

act gtc gcg cac acg atg ate ggt ctg gag egg ctg gac aac etc cag 192 
Thr val Ala His Thr Met lie Gly Leu Glu Arg Leu Asp Asn Leu Gin 
50 55 60 

cac tgc gtg gaa gec gtg etc gca gac ggt gtt ccc ggg gat ttc gec 240 
His cys val Glu Ala Val Leu Ala Asp Gly Val Pro Gly Asp Phe Ala 
65 70 75 80 

gag acc ggt gtc tgg egg ggc ggc gca tgc ate ttc atg cgc gcg gtt 288 
Glu Thr Gly val Trp Arg Gly Gly Ala Cys lie Phe Met Arg Ala val 
85 90 95 

etc cag gca ttc gga gat acc gga cgt acc gtc tgg gta gtg gat tec 336 
Leu Gin Ala Phe Gly Asp Thr Gly Arg Thr Val Trp Val Val Asp Ser 
100 105 110 

ttc cag gga atg ccg gaa age tct gcg caa gac cac caa gcg gac cag 384 
Phe Gin Gly Met Pro Glu Ser Ser Ala Gin Asp His Gin Ala Asp Gin 
115 120 125 

get atg gcg ctg cac gag tac aac gac gtg ctt ggc gta teg ctt gag 432 
Ala Met Ala Leu His Glu Tyr Asn Asp Val Leu Gly Val Ser Leu Glu 
130 135 140 

acc gtc egg cag aac ttc gec cgc tac ggg ctg etc gac gaa cag gtc 480 
Thr val Arg Gin Asn Phe Ala Arg Tyr Gly Leu Leu Asp Glu Gin Val 
145 150 ~ " 155 160 

agg ttc etc ccc ggc tgg ttc egg gac acc ttg ccc acc gec ccc ate 528 
Arg Phe Leu Pro Gly Trp Phe Arg Asp Thr Leu Pro Thr Ala Pro lie 
165 170 175 

cag gaa etc gec gtg eta cga etc gac ggc gac etc tac gaa tec aca 576 
Gin Glu Leu Ala val Leu Arg Leu Asp Gly Asp Leu Tyr Glu ser Thr 
180 185 190 

atg gac tea ttg egg aac ctg tac ccg aag etc teg ccg ggc gga ttc 624 
Met Asp Ser Leu Arg Asn Leu Tyr Pro Lys Leu Ser Pro Gly Gly Phe 
195 200 205 

gtc ate ate gac gac tat ttt ctg ccg tec tgc cag gac gcg gtg aag 672 
val He lie Asp Asp Tyr Phe Leu Pro Ser Cys Gin Asp Ala val Lys 
210 215 220 

ggg ttc cgc gcg gaa etc ggg ate acg gaa ccc ate cac gac ate gac 720 
Gly Phe Arg Ala Glu Leu Gly lie Thr Glu Pro lie His Asp lie Asp 
225 " 230 235 240 

ggc acg ggc gee tac tgg cgc cgc age tgg tga 753 
Gly Thr Gly Ala Tyr Trp Arg Arg Ser Trp 
245 ~ 250 
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<210> 12 
<211> 250 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 12 

Met Pro Ser Gin Asn Ala Leu Tyr Leu Asp Leu Leu Lys Lys Val Leu 
1 5 "10 15 

Thr Asn Thr lie Tyr Ser Asp Arg Pro His Pro Asn Ala Trp Gin Asp 
20 25 30 

Asn Thr Asp Tyr Arg Gin Ala Ala Arg Ala Lys Gly Thr Asp Trp Pro 
35 ~ 40 ~ 45 

Thr Val Ala His Thr Met lie Gly Leu Glu Arg Leu Asp Asn Leu Gin 
50 55 60 

His Cys val Glu Ala val Leu Ala Asp Gly val Pro Gly Asp Phe Ala 
65 70 75 80 

Glu Thr Gly Val Trp Arg Gly Gly Ala Cys lie Phe Met Arg Ala Val 
85 90 95 

Leu Gin Ala Phe Gly Asp Thr Gly Arg Thr val Trp Val val Asp Ser 
100 105 110 

Phe Gin Gly Met Pro Glu Ser Ser Ala Gin Asp His Gin Ala Asp Gin 
115 120 125 

Ala Met Ala Leu His Glu Tyr Asn Asp val Leu Gly val ser Leu Glu 
130 135 140 

Thr Val Arg Gin Asn Phe Ala Arg Tyr Gly Leu Leu Asp Glu Gin Val 
145 " 150 " 155 160 

Arg Phe Leu Pro Gly Trp Phe Arg Asp Thr Leu Pro Thr Ala Pro lie 
165 170 175 

Gin Glu Leu Ala val Leu Arg Leu Asp Gly Asp Leu Tyr Glu Ser Thr 
180 185 190 

Met Asp Ser Leu Arg Asn Leu Tyr Pro Lys Leu Ser Pro Gly Gly Phe 
195 200 205 

Val lie lie Asp Asp Tyr Phe Leu Pro Ser Cys Gin Asp Ala val Lys 
210 215 220 

Gly Phe Arg Ala Glu Leu Gly lie Thr Glu Pro lie His Asp lie Asp 
225 " 230 235 240 

Gly Thr Gly Ala Tyr Trp Arg Arg Ser Trp 
245 " 250 



<210> 13 
<211> 1188 
<212> DNA 

<213> Saccharopolyspora spinosa 
<220> 
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<221> CDS 

<222> (1)..(1185) 

<223> 0RF4; O-methyl transferase 

<400> 13 

atg agt gag ate gca gtt gec ccc tgg teg gtg gtg gag cgt ttg ctg 48 

Met Ser Glu lie Ala val Ala Pro Trp ser val val Glu Arg Leu Leu 
15 10 15 



etc gcg gcg ggt gcg ggc ccg gcg aag etc cag gaa gca gtq cag gtq 
Leu Ala Ala Gly Ala Gly Pro Ala Lys Leu Gin Glu Ala val Gin val 
20 25 30 



egg aac agg gtq tec ace gcg gtq cac gee gtq ctg gee gca gee gee 
Arg Asn Arg val Ser Thr Ala val His Ala val Leu Ala Ala Ala Ala 
145 150 155 160 



ggt tat cac gca ccc gaa etc ggt ggt get teg ctg cgc atg tgg cag 
Gly Tyr His Ala Pro Glu Leu Gly Gly Ala Ser Leu Arg Met Trp Gin 
210 215 220 



96 



gee gga ctg gac gcg gtg gee gac gee ate gtc gac gaa etc gtc gta 144 
Ala Gly Leu Asp Ala val Ala Asp Ala lie val Asp Glu Leu val val 
35 40 45 

cgc tgc gat ccg ctg teg ttg gac gag teg gtg cga ate ggc ctg gag 192 
Arg Cys Asp Pro Leu Ser Leu Asp Glu Ser Val Arg lie Gly Leu Glu 
50 55 60 

ate act tct ggc get cag ctg gtc egg aga acc gtt gag etc gat cac 240 
lie Thr ser Gly Ala Gin Leu val Arg Arg Thr val Glu Leu Asp His 
65 70 " 75 80 

gca ggc ctg egg etc gcg gcg gtc gee gaa gca get get gtt etc egg 288 
Ala Gly Leu Arg Leu Ala Ala val Ala Glu Ala Ala Ala val Leu Arg 
85 90 95 

ttc gac gcg gtg gat ctg ctg gaa ggg etc ttc ggc ccg gtt gac ggc 336 
Phe Asp Ala val Asp Leu Leu Glu Gly Leu Phe Gly Pro val Asp Gly 
100 105 110 

agg egg cac aac age cgt gaa gtc cgc tgg teg gac age atg acg cag 384 
Arg Arg His Asn Ser Arg Glu Val Arg Trp Ser Asp Ser Met Thr Gin 
115 120 125 

ttc teg ccc gac cag ggc etc gee ggc gcg cag cgc ctg ctg gcg ttc 432 
Phe Ser Pro Asp Gin Gly Leu Ala Gly Ala Gin Arg Leu Leu Ala Phe 
130 135 140 



480 



acc agg cgc gcg gac etc ggt gcg ctg gca gtc cgc tac gga tec gac 528 
Thr Arg Arg Ala Asp Leu Gly Ala Leu Ala Val Arg Tyr Gly ser Asp 
165 170 ^ 175 

aaa tgg gcg gac ctg cac tgg tac acc gaa cac tac gag cac cac ttc 576 
Lys Trp Ala Asp Leu His Trp Tyr Thr Glu His Tyr Glu His His Phe 
180 185 190 

tec cga ttc cag gat gec ccg gtg cga gtg ttg gaa ata gga ate ggt 624 
ser Arg Phe Gin Asp Ala Pro val Arg val Leu Glu lie Gly lie Gly 
195 200 205 



672 



egg tac ttc egg cga ggt etc gtt tac ggg ctg gac att ttc gag aaa 720 
Arg Tyr Phe Arg Arg Gly Leu val Tyr Gly Leu Asp lie Phe Glu Lys 
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225 230 235 240 

gcc ggg aac gaa ggg cac cga gtg cga aag ctg cga gqt gac cag age 768 
Ala Gly Asn Glu Gly His Arg val Arg Lys Leu Arg Gly Asp Gin Ser 
245 250 ~ ' 255 

gat gcg gaa ttc ctg gaa gac atg gcg gqg aag ate gqg ccg ttc gac 816 
Asp Ala Glu Phe Leu Glu Asp Met Ala Gly Lys lie Gly Pro Phe Asp 
260 265 270 

att gtc ate gac gac gqc age cat gtc aac gac cac gtc aag aaa tec 864 
lie val lie Asp Asp Gly ser His val Asn Asp His val Lys Lys Ser 
275 280 285 

ttc caa tec ctg ttt ccg cac gtc cgc cca gqt gqt ttg tac gtc ate 912 
Phe Gin Ser Leu Phe Pro His Val Arg Pro Gly Gly Leu Tyr Val lie 
290 295 300 

gag gat etc cag acg gcg tac tgg ccc gqc tac gqc gqt cgc gat gqg 960 
Glu Asp Leu Gin Thr Ala Tyr Trp Pro Gly Tyr Gly Gly Arg Asp Gly 
305 310 315 320 

gaa ccc gcg gcc cag cgc acc teg ate gac atg etc aaa gaa ctg ate 1008 
Glu Pro Ala Ala Gin Arg Thr ser lie Asp Met Leu Lys Glu Leu lie 
325 330 335 

gac gqc ctg cat tat cag gag cgc gaa teg egg tgc gqg acc gag ccc 1056 
Asp Gly Leu His Tyr Gin Glu Arg Glu Ser Arg Cys Gly Thr Glu Pro 
340 345 350 

tec tac acg gaa egg aac gtg gcg gcc ctg cac ttc tac cac aac ctg 1104 
Ser Tyr Thr Glu Arg Asn val Ala Ala Leu His Phe Tyr His Asn Leu 
355 360 365 

gta ttc gtg gag aaa ggg etc aac get gag cct gcc gcg ccg ggg ttc 1152 
Val Phe Val Glu Lys Gly Leu Asn Ala Glu Pro Ala Ala Pro Gly Phe 
370 375 380 

gtg ccc egg caa gcg etc gqc gtc gag gac ggc tga 1188 
val Pro Arg Gin Ala Leu Gly val Glu Asp Gly 
385 390 395 

<210> 14 
<211> 395 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 14 

Met Ser Glu lie Ala val Ala Pro Trp Ser val Val Glu Arg Leu Leu 
15 10 15 

Leu Ala Ala Gly Ala Gly Pro Ala Lys Leu Gin Glu Ala Val Gin Val 
20 25 30 

Ala Gly Leu Asp Ala val Ala Asp Ala lie val Asp Glu Leu val Val 
35 40 45 

Arg Cys Asp Pro Leu Ser Leu Asp Glu ser Val Arg lie Gly Leu Glu 
50 55 60 

He Thr ser Gly Ala Gin Leu val Arg Arg Thr Val Glu Leu Asp His 
65 70 ~ 75 80 
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Ala Gly Leu Arg Leu Ala Ala val Ala Glu Ala Ala Ala Val Leu Arg 
85 90 95 

Phe Asp Ala val Asp Leu Leu Glu Gly Leu Phe Gly Pro val Asp Gly 
100 105 110 

Arg Arg His Asn Ser Arg Glu Val Arg Trp ser Asp Ser Met Thr Gin 
115 ~ 120 ~ 125 

Phe Ser Pro Asp Gin Gly Leu Ala Gly Ala Gin Arg Leu Leu Ala Phe 
130 135 ' 140 

Arg Asn Arg Val Ser Thr Ala Val His Ala val Leu Ala Ala Ala Ala 
145 150 155 160 

Thr Arg Arg Ala Asp Leu Gly Ala Leu Ala Val Arg Tyr Gly Ser Asp 
165 170 ~ 175 

Lys Trp Ala Asp Leu His Trp Tyr Thr Glu His Tyr Glu His His Phe 
180 185 190 

Ser Arg Phe Gin Asp Ala Pro val Arg val Leu Glu lie Gly lie Gly 
195 200 205 

Gly Tyr His Ala Pro Glu Leu Gly Gly Ala Ser Leu Arg Met Trp Gin 
210 215 220 

Arg Tyr Phe Arg Arg Gly Leu Val Tyr Gly Leu Asp lie Phe Glu Lys 
225 230 235 240 

Ala Gly Asn Glu Gly His Arg Val Arg Lys Leu Arg Gly Asp Gin Ser 
245 250 ~ ' 255 

Asp Ala Glu Phe Leu Glu Asp Met Ala Gly Lys lie Gly Pro Phe Asp 
260 265 270 

lie val lie Asp Asp Gly Ser His val Asn Asp His Val Lys Lys Ser 
275 280 285 

Phe Gin Ser Leu Phe Pro His val Arg Pro Gly Gly Leu Tyr Val lie 
290 295 300 

Glu Asp Leu Gin Thr Ala Tyr Trp Pro Gly Tyr Gly Gly Arg Asp Gly 
305 310 315 320 

Glu Pro Ala Ala Gin Arg Thr Ser lie Asp Met Leu Lys Glu Leu lie 
325 330 335 

Asp Gly Leu His Tyr Gin Glu Arg Glu Ser Arg Cys Gly Thr Glu Pro 
340 345 350 

Ser Tyr Thr Glu Arg Asn Val Ala Ala Leu His Phe Tyr His Asn Leu 
355 360 365 

val Phe val Glu Lys Gly Leu Asn Ala Glu Pro Ala Ala Pro Gly Phe 
370 375 380 

Val Pro Arg Gin Ala Leu Gly Val Glu Asp Gly 
385 390 395 
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<210> 15 
<211> 1620 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1617) 

<223> ORF5; c-c linking enzyme, cyclization enzyme 
<400> 15 

atg ate teg get gcg gqc gaa caa agt gqa cca gtc aga aaa gqa gqg 48 
Met lie Ser Ala Ala Gly Glu Gin Ser Gly Pro val Arg Lys Gly Gly 
15 10 15 

gcg gtq ccc gaa ttc cat gac ccg gca ccc atg aat cgt cga acc cca 96 
Ala val Pro Glu Phe His Asp Pro Ala Pro Met Asn Arg Arg Thr Pro 
20 25 30 

gqa aca gag ate acc gtc gag ccc gac gat cct cgt tat ccg gac etc 144 
Gly Thr Glu lie Thr val Glu Pro Asp Asp Pro Arg Tyr Pro Asp Leu 
35 40 45 

gtc gtc gqg cac aac ccc cgt ttc acc gqa aaa ccc gaa cgc ate cac 192 
val Val Gly His Asn Pro Arg Phe Thr Gly Lys Pro Glu Arg lie His 
50 55 60 

ate gee age tec gee gaa gac gtc gtq cac gec gtc gec gac gee gtq 240 
lie Ala Ser Ser Ala Glu Asp Val Val His Ala Val Ala Asp Ala Val 
65 70 75 80 

cgc acc gqc agg egg gta gqg gtc cgc age gqc gqg cac tgc ttc gag 288 
Arg Thr Gly Arg Arg val Gly val Arg Ser Gly Gly His Cys Phe Glu 
85 90 95 

aat etc gtt gcg gac ccg gcg ate cga gtq etc gtc gac etc tec gag 336 
Asn Leu Val Ala Asp Pro Ala lie Arg Val Leu Val Asp Leu Ser Glu 
100 105 110 

etc aac cgc gtq tac tac gac age acg cgc gqg gca ttc gcg ate gag 384 
Leu Asn Arg Val Tyr Tyr Asp Ser Thr Arg Gly Ala Phe Ala lie Glu 
115 120 " " 125 

gcg gqc gee gec etc gqg cag gtq tac cga acc ctg ttc aag aac tgg 432 
Ala Gly Ala Ala Leu Gly Gin val Tyr Arg Thr Leu Phe Lys Asn Trp 
130 135 " 140 

gqc gtq acg ate ccg acc gqc gca tgt ccc gqg gtq gqc gca gqc gqg 480 
Gly val Thr lie Pro Thr Gly Ala Cys Pro Gly Val Gly Ala Gly Gly 
145 150 155 160 



cac ate etc gqc gqg gqa tac gqc ccg ctg teg cgc cga ttc gqt teg 
His lie Leu Gly Gly Gly Tyr Gly Pro Leu Ser Arg Arg Phe Gly Ser 
165 170 175 



528 



gtc gtc gac tac ctt caa gqc gtc gag gtc gtc gtq gtc gac cag gec 576 

val Val Asp Tyr Leu Gin Gly Val Glu Val Val val Val Asp Gin Ala 

180 185 190 

gqt gaa gtq cac ate gtc gag gee gac egg aac tec acg gqc gee gqt 624 

Gly Glu Val His lie val Glu Ala Asp Arg Asn Ser Thr Gly Ala Gly 
195 200 205 
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cac gac ttg tgg tgg gcg cac acc gqt gqc gqt gqc gqc aac ttc gqg 
His Asp Leu Trp Trp Ala His Thr Gly Gly Gly Gly Gly Asn Phe Gly 
210 215 220 



672 



ate gtc acc agg ttt tgg etc cga acg ccg gac gtq gtc age acc gac 720 

lie val Thr Arg Phe Trp Leu Arg Thr pro Asp val Val Ser Thr Asp 

225 230 235 240 

gec gca gag etc ctg cca egg ccg ccc gcg aca gtq ctg etc cga teg 768 

Ala Ala Glu Leu Leu Pro Arg Pro Pro Ala Thr val Leu Leu Arg Ser 
245 250 255 

ttc cac tgg ccg tgg cac gaa ctg aca gag cag tea ttc gee gtq etc 816 

Phe His Trp Pro Trp His Glu Leu Thr Glu Gin Ser Phe Ala Val Leu 
260 265 270 

eta cag aac ttc gqc aat tgg tac gag cag cac age gcg cct gaa tec 864 

Leu Gin Asn Phe Gly Asn Trp Tyr Glu Gin His Ser Ala Pro Glu Ser 
275 280 285 

acg caa etc gqg ttg ttc age acg etc gtc tgc gca cac egg caa get 912 

Thr Gin Leu Gly Leu Phe Ser Thr Leu val Cys Ala His Arg Gin Ala 
290 295 300 

gqc tac gtc acg ctg aac gtt cac ctg gac gqc acg gat ccg aac gcg 960 

Gly Tyr val Thr Leu Asn val His Leu Asp Gly Thr Asp Pro Asn Ala 

305 310 315 320 

gaa cgc acc ctg gec gaa cac ctg teg gcg ate aac gee cag gtc gqc 1008 

Glu Arg Thr Leu Ala Glu His Leu Ser Ala lie Asn Ala Gin Val Gly 
325 330 335 

gtq act cca gec gaa ggg ctg egg gaa acc ctg ccg tgg ttg cga teg 1056 

Val Thr Pro Ala Glu Gly Leu Arg Glu Thr Leu Pro Trp Leu Arg ser 
340 345 350 



acc cag gtq gec gqg gcg ate gec gaa gqc gqc gaa ccg ggc atg caa 
Thr Gin Val Ala Gly Ala lie Ala Glu Gly Gly Glu Pro Gly Met Gin 
355 360 365 



1104 



egg acc aag gtc aaa gee gee tac ttg cgc acc gqg ctg tec gaa get 1152 

Arg Thr Lys Val Lys Ala Ala Tyr Leu Arg Thr Gly Leu Ser Glu Ala 
370 375 380 

caa eta gee acg gtt tac egg egg ctg acc gtc tac gqa tac gac aac 1200 

Gin Leu Ala Thr val Tyr Arg Arg Leu Thr Val Tyr Gly Tyr Asp Asn 
385 390 395 400 

cct gcg gcg gcg ctg ttg ctg etc gqt tac gqc gqt atg gcg aat gee 1248 

Pro Ala Ala Ala Leu Leu Leu Leu Gly Tyr Gly Gly Met Ala Asn Ala 
405 410 415 

gtq get ccg teg gec acc gca etc get cag cgc gac teg gtt etc aaa 1296 

val Ala Pro Ser Ala Thr Ala Leu Ala Gin Arg Asp ser val Leu Lys 

420 425 ~ 430 

gcg ctg ttc gtc acg aac tgg teg gag ccc gec gag gac gag egg cat 1344 

Ala Leu Phe Val Thr Asn Trp ser Glu Pro Ala Glu Asp Glu Arg His 

435 440 445 

ctg acc tgg att cgc gqt ttc tac cgc gag atg tac gec gaa acc gqc 1392 

Leu Thr Trp lie Arg Gly Phe Tyr Arg Glu Met Tyr Ala Glu Thr Gly 
450 455 " 460 
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gga gtt ccg gtq cca ggt acc cgt gtc gac gqc tec tac ate aac tac 1440 

Gly val Pro val Pro Gly Thr Arg val Asp Gly ser Tyr lie Asn Tyr 
465 470 - 475 * 480 

ccg gac acc gac ttg gec gat cca ttg tgg aac acc tec ggt gtt gec 1488 
Pro Asp Thr Asp Leu Ala Asp Pro Leu Trp Asn Thr Ser Gly Val Ala 
485 490 495 

tgg cac gac ctg tac tac aaa gac aac tac ccg egg ctg cag egg gee 1536 
Trp His Asp Leu Tyr Tyr Lys Asp Asn Tyr Pro Arg Leu Gin Arg Ala 
500 505 510 

aaa gcg egg tgg gat ccg cag aac ate ttc cag cac ggc ctg teg ate 1584 
Lys Ala Arg Trp Asp Pro Gin Asn lie Phe Gin His Gly Leu ser lie 
515 520 525 

aaa ccg ccg gca egg ctt tea ccc ggt cag cca tga 1620 
Lys Pro Pro Ala Arg Leu Ser Pro Gly Gin Pro 
530 ~ 535 

<210> 16 
<211> 539 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 16 

Met lie Ser Ala Ala Gly Glu Gin ser Gly Pro val Arg Lys Gly Gly 
15 10 15 

Ala val Pro Glu Phe His Asp Pro Ala Pro Met Asn Arg Arg Thr Pro 
20 25 30 

Gly Thr Glu lie Thr Val Glu Pro Asp Asp Pro Arg Tyr Pro Asp Leu 
35 40 45 

val val Gly His Asn Pro Arg Phe Thr Gly Lys Pro Glu Arg lie His 
50 55 60 

lie Ala Ser ser Ala Glu Asp Val Val His Ala val Ala Asp Ala val 
65 70 75 80 

Arg Thr Gly Arg Arg val Gly Val Arg ser Gly Gly His Cys Phe Glu 
85 90 95 

Asn Leu val Ala Asp Pro Ala lie Arg val Leu val Asp Leu Ser Glu 
100 105 110 

Leu Asn Arg val Tyr Tyr Asp Ser Thr Arg Gly Ala Phe Ala lie Glu 
115 120 ~ 125 

Ala Gly Ala Ala Leu Gly Gin Val Tyr Arg Thr Leu Phe Lys Asn Trp 
130 135 * ~ 140 

Gly val Thr lie Pro Thr Gly Ala cys Pro Gly Val Gly Ala Gly Gly 
145 150 155 160 

His lie Leu Gly Gly Gly Tyr Gly Pro Leu ser Arg Arg Phe Gly Ser 
165 170 ~ ^ 175 

val val Asp Tyr Leu Gin Gly Val Glu val val val Val Asp Gin Ala 
180 185 190 
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Gly Glu 



val 
195 



His 



lie 



val 



Glu 



Ala 
200 



Asp 



Arg 



Asn 



Ser 



Thr 
205 



Gly 



Ala Gly 



His Asp Leu Trp Trp Ala His Thr Gly Gly Gly Gly Gly Asn Phe Gly 
210 215 220 

lie val Thr Arg Phe Trp Leu Arg Thr Pro Asp Val Val Ser Thr Asp 
225 230 235 240 

Ala Ala Glu Leu Leu Pro Arg Pro Pro Ala Thr Val Leu Leu Arg ser 
245 250 255 

Phe His Trp Pro Trp His Glu Leu Thr Glu Gin Ser Phe Ala val Leu 
260 265 270 

Leu Gin Asn Phe Gly Asn Trp Tyr Glu Gin His Ser Ala Pro Glu Ser 
275 280 285 

Thr Gin Leu Gly Leu Phe Ser Thr Leu Val Cys Ala His Arg Gin Ala 
290 295 300 

Gly Tyr val Thr Leu Asn Val His Leu Asp Gly Thr Asp Pro Asn Ala 
305 310 315 320 

Glu Arg Thr Leu Ala Glu His Leu ser Ala lie Asn Ala Gin val Gly 
325 330 335 

Val Thr Pro Ala Glu Gly Leu Arg Glu Thr Leu Pro Trp Leu Arg Ser 
340 345 350 

Thr Gin val Ala Gly Ala lie Ala Glu Gly Gly Glu Pro Gly Met Gin 
355 360 365 

Arg Thr Lys val Lys Ala Ala Tyr Leu Arg Thr Gly Leu Ser Glu Ala 
370 375 ~ 380 

Gin Leu Ala Thr Val Tyr Arg Arg Leu Thr val Tyr Gly Tyr Asp Asn 
385 390 395 400 

Pro Ala Ala Ala Leu Leu Leu Leu Gly Tyr Gly Gly Met Ala Asn Ala 
405 410 415 

Val Ala Pro Ser Ala Thr Ala Leu Ala Gin Arg Asp Ser Val Leu Lys 
420 425 ' 430 

Ala Leu Phe val Thr Asn Trp Ser Glu Pro Ala Glu Asp Glu Arg His 
435 440 445 

Leu Thr Trp lie Arg Gly Phe Tyr Arg Glu Met Tyr Ala Glu Thr Gly 
450 455 460 

Gly val Pro val Pro Gly Thr Arg val Asp Gly Ser Tyr lie Asn Tyr 
465 470 ~ 475 480 

Pro Asp Thr Asp Leu Ala Asp Pro Leu Trp Asn Thr Ser Gly Val Ala 
485 490 495 

Trp His Asp Leu Tyr Tyr Lys Asp Asn Tyr pro Arg Leu Gin Arg Ala 
500 505 510 

Lys Ala Arg Trp Asp Pro Gin Asn lie Phe Gin His Gly Leu Ser lie 



515 



520 



525 
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Lys Pro Pro Ala Arg Leu Ser Pro Gly Gin Pro 
530 535 



<210> 17 
<211> 1194 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(1191) 

<223> ORF6; methyl transferase 

<400> 17 

atg tec aca acg cac gag ate gaa acc gtg gaa cgc ate ate etc gee 48 

Met Ser Thr Thr His Glu lie Glu Thr val Glu Arg lie lie Leu Ala 
1 5 10 15 

gee gga tec agt gcg gcg age ctg gee gac ctg acc acc gaa etc gga 96 
Ala Gly Ser Ser Ala Ala Ser Leu Ala Asp Leu Thr Thr Glu Leu Gly 
20 25 30 

etc gee agg ate gca ccc gtq ctg ate gac gag ate etc ttc cgc gcg 144 
Leu Ala Arg lie Ala Pro val Leu lie Asp Glu lie Leu Phe Arg Ala 
35 40 45 

gaa ccg gec ccc gac ate gaa egg acc gag gtc gcg gtc cag ate acc 192 
Glu Pro Ala Pro Asp lie Glu Arg Thr Glu Val Ala Val Gin lie Thr 
50 55 60 

cac cga ggc gag acc gtt gac ttc gtc ctg acg eta cag tec ggt gag 240 
His Arg Gly Glu Thr Val Asp Phe Val Leu Thr Leu Gin Ser Gly Glu 
65 70 75 80 

ctg ate aag gec gag caa cga ccg gtc gqa gac gtc ccg ctg egg ate 288 
Leu lie Lys Ala Glu Gin Arg Pro Val Gly Asp val Pro Leu Arg lie 
85 " 90 95 

ggt tac gag etc acc gat etc ate gec gag ttg ttc ggc cca gga get 336 
Gly Tyr Glu Leu Thr Asp Leu lie Ala Glu Leu Phe Gly Pro Gly Ala 
100 105 110 

ccc agg gee gtc ggc gee egg age acc aac ttc etc cga acc acc aca 384 
Pro Arg Ala val Gly Ala Arg Ser Thr Asn Phe Leu Arg Thr Thr Thr 
115 120 125 

tec ggt teg ata ccc ggt ccg teg gaa ctg tec gat ggc ttc cag gec 432 
Ser Gly Ser lie Pro Gly Pro Ser Glu Leu Ser Asp Gly Phe Gin Ala 
130 135 140 

ate tec gca gtg gtc gee ggc tgc ggg cac cga cgt ccc gac etc aac 480 
lie ser Ala Val Val Ala Gly cys Gly His Arg Arg Pro Asp Leu Asn 
145 150 155 160 

ttg etc gee tec cac tac cgc acg gac aag tgg ggc ggc ctg cac tgg 528 
Leu Leu Ala Ser His Tyr Arg Thr Asp Lys Trp Gly Gly Leu His Trp 
165 " 170 175 

ttc acc ccg eta tac gag cga cac etc ggc gag ttc cgt gat cgc ccg 576 
Phe Thr Pro Leu Tyr Glu Arg His Leu Gly Glu Phe Arg Asp Arg Pro 
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180 185 190 

gtq cgc ate ctg gag ate gqt gtc gqt gqc tac aac ttc gac gqt gqc 
val Arg lie Leu Glu lie Gly val Gly Gly Tyr Asn Phe Asp Gly Gly 
195 200 205 

gqc gqc gaa tec ctg aag atg tgg aag cgc tac ttc cac cgc gqc etc 
Gly Gly Glu ser Leu Lys Met Trp Lys Arg Tyr Phe His Arg Gly Leu 
210 215 220 



cac ate aac gqa cac gtq cgc aca tec ctg gaa acg ctg ttc ccc egg 
His lie Asn Gly His val Arg Thr Ser Leu Glu Thr Leu Phe Pro Arg 
275 280 285 

ttg cgc age gqt gqc gta tac gtq ate gag gat ctg tgg acg ace tat 
Leu Arg Ser Gly Gly val Tyr Val lie Glu Asp Leu Trp Thr Thr Tyr 
290 295 300 

get ccc gqa ttc gqc gqg cag gcg cag tgc ccg gee gca ccc gqc ace 
Ala Pro Gly Phe Gly Gly Gin Ala Gin cys Pro Ala Ala Pro Gly Thr 
305 310 315 320 



aac gee gaa gqc gqc gtt cct get tgg gtq cca agg agt ctg gac gac 
Asn Ala Glu Gly Gly val Pro Ala Trp Val Pro Arg ser Leu Asp Asp 
370 375 380 



624 



672 



gtq ttc gqg atg gac gtt ttc gac aag tec ttc etc gac cag cag agg 720 

val Phe Gly Met Asp val Phe Asp Lys Ser Phe Leu Asp Gin Gin Arg 
225 230 235 240 

etc tgc acc gtc cgc gee gac cag age aag ccc gag gag ctg gec gec 768 

Leu Cys Thr val Arg Ala Asp Gin Ser Lys Pro Glu Glu Leu Ala Ala 

245 250 255 

gtt gac gac aag tac gqa ccg ttc gac ate ate ate gac gat gqc age 816 

val Asp Asp Lys Tyr Gly Pro Phe Asp lie lie lie Asp Asp Gly Ser 

260 265 270 



864 



912 



960 



acg gtc age ctg etc aag aac ctg ttg gaa gqc gtt cag cac gag gag 1008 
Thr val Ser Leu Leu Lys Asn Leu Leu Glu Gly Val Gin His Glu Glu 
325 330 335 

cag ccg cat gcg gqc teg tac gag ccg age tac ctg gaa cgc aat ttg 1056 
Gin Pro His Ala Gly Ser Tyr Glu Pro Ser Tyr Leu Glu Arg Asn Leu 
340 * 345 350 

gtc gqc etc cac acc tac cac aac ate gcg ttc ctg gag aaa gqc gtc 1104 
val Gly Leu His Thr Tyr His Asn lie Ala Phe Leu Glu Lys Gly val 
355 360 365 



1152 



ata ttg cac ctg gec gac gtq aac age gcg gag gac gag tga 1194 
lie Leu His Leu Ala Asp Val Asn Ser Ala Glu Asp Glu 
385 390 395 

<210> 18 
<211> 397 
<212> PRT 

<213> saccharopolyspora spinosa 
<400> 18 

Met Ser Thr Thr His Glu lie Glu Thr val Glu Arg lie lie Leu Ala 
1 5 10 15 
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Ala Gly ser Ser Ala Ala Ser Leu Ala Asp Leu Thr Thr Glu Leu Gly 
20 25 30 

Leu Ala Arg lie Ala Pro Val Leu lie Asp Glu lie Leu Phe Arg Ala 
35 40 45 

Glu Pro Ala Pro Asp lie Glu Arg Thr Glu val Ala val Gin lie Thr 
50 55 60 

His Arg Gly Glu Thr Val Asp Phe Val Leu Thr Leu Gin Ser Gly Glu 
65 ~ 70 75 80 

Leu lie Lys Ala Glu Gin Arg Pro Val Gly Asp Val Pro Leu Arg He 
85 90 95 

Gly Tyr Glu Leu Thr Asp Leu lie Ala Glu Leu Phe Gly Pro Gly Ala 
100 105 110 

Pro Arg Ala Val Gly Ala Arg Ser Thr Asn Phe Leu Arg Thr Thr Thr 
115 120 125 

Ser Gly ser He Pro Gly Pro Ser Glu Leu Ser Asp Gly Phe Gin Ala 
130 135 140 

lie Ser Ala val val Ala Gly cys Gly His Arg Arg pro Asp Leu Asn 
145 150 155 160 

Leu Leu Ala Ser His Tyr Arg Thr Asp Lys Trp Gly Gly Leu His Trp 
165 170 175 

Phe Thr Pro Leu Tyr Glu Arg His Leu Gly Glu Phe Arg Asp Arg Pro 
180 185 190 

Val Arg lie Leu Glu lie Gly val Gly Gly Tyr Asn Phe Asp Gly Gly 
195 200 205 

Gly Gly Glu Ser Leu Lys Met Trp Lys Arg Tyr Phe His Arg Gly Leu 
210 215 220 

val Phe Gly Met Asp val Phe Asp Lys Ser Phe Leu Asp Gin Gin Arg 
225 230 ' 235 240 

Leu Cys Thr Val Arg Ala Asp Gin Ser Lys Pro Glu Glu Leu Ala Ala 
245 250 255 

Val Asp Asp Lys Tyr Gly Pro Phe Asp lie lie lie Asp Asp Gly Ser 
260 265 270 

His lie Asn Gly His val Arg Thr Ser Leu Glu Thr Leu Phe Pro Arg 
275 280 285 

Leu Arg Ser Gly Gly val Tyr val lie Glu Asp Leu Trp Thr Thr Tyr 
290 295 300 

Ala Pro Gly Phe Gly Gly Gin Ala Gin Cys Pro Ala Ala Pro Gly Thr 
305 310 315 320 

Thr val Ser Leu Leu Lys Asn Leu Leu Glu Gly val Gin His Glu Glu 
325 330 335 

Gin Pro His Ala Gly Ser Tyr Glu Pro Ser Tyr Leu Glu Arg Asn Leu 
340 345 350 
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val Gly Leu His Thr Tyr His Asn lie Ala Phe Leu Glu Lys Gly Val 
355 360 365 

Asn Ala Glu Gly Gly Val Pro Ala Trp Val Pro Arg Ser Leu Asp Asp 
370 375 380 

lie Leu His Leu Ala Asp Val Asn Ser Ala Glu Asp Glu 
385 390 395 



<210> 19 
<211> 900 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(897) 

<223> ORF7; O-methyl transferase 

<400> 19 

gtg aac age aga gqg gcg aac aca cag gca ttt ccg acc gcg gat cag 48 
val Asn Ser Arg Gly Ala Asn Thr Gin Ala Phe Pro Thr Ala Asp Gin 
1 5 10 15 

gtq gag tec ate ttc gat gcg ttg gcg cac gqg cgt ccc ctg cac cac 96 
Val Glu Ser lie Phe Asp Ala Leu Ala His Gly Arg Pro Leu His His 
20 25 30 

ggt tac tgg gcg ggc gqg tat egg gag gat gee ggt gee aca ccg tgg 144 
Gly Tyr Trp Ala Gly Gly Tyr Arg Glu Asp Ala Gly Ala Thr Pro Trp 
35 40 45 

teg gat get gec gac caa ctg acc gac ctg ttc ate gac aag gee gcg 192 
Ser Asp Ala Ala Asp Gin Leu Thr Asp Leu Phe lie Asp Lys Ala Ala 
50 55 60 

etc cgt ccc gga gcg cac ctg ttc gac ctg ggc tgc ggc aat ggg cag 240 
Leu Arg Pro Gly Ala His Leu Phe Asp Leu Gly Cys Gly Asn Gly Gin 
65 70 75 80 

ccc gta gtc cgt gcg gca tgc gec age ggc gtt cga gtc acc gqa ate 288 
Pro val val Arg Ala Ala cys Ala Ser Gly Val Arg Val Thr Gly lie 
85 90 ~ 95 

acc gtg aac gec cag cat etc gee gec gee acc agg etc gee aac gag 336 
Thr val Asn Ala Gin His Leu Ala Ala Ala Thr Arg Leu Ala Asn Glu 
100 105 110 

acc gga ctg gee ggc agt ctt gag ttc gat eta gtc gac ggc gee cag 384 
Thr Gly Leu Ala Gly Ser Leu Glu Phe Asp Leu Val Asp Gly Ala Gin 
115 120 125 

ctg ccc tac ccg gac ggt ttc ttt cag gee gca tgg gcg atg cag tec 432 
Leu Pro Tyr Pro Asp Gly Phe Phe Gin Ala Ala Trp Ala Met Gin Ser 
130 135 140 



gtc gtg cag ate gtg gac cag gec gec gcg ate cgc gag gtc cac cga 
val val Gin lie val Asp Gin Ala Ala Ala lie Arg Glu Val His Arg 
145 150 155 160 
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ate ctg gaa ccc gqc gqc egg ttc gtc etc gqa gac ate ate act egg 528 
lie Leu Glu Pro Gly Gly Arg Phe val Leu Gly Asp lie lie Thr Arg 
165 170 175 

gtt cga etc ccg gaa gag tac gcg gcg gtt tgg acg gqc acg acc gec 576 
val Arg Leu Pro Glu Glu Tyr Ala Ala Val Trp Thr Gly Thr Thr Ala 
180 185 190 

cat acc ttg aac age ttc acg gcg ctg gtc age gaa gec gqg ttc gag 624 
His Thr Leu Asn Ser Phe Thr Ala Leu Val Ser Glu Ala Gly Phe Glu 
195 200 205 

att etc gaa gtc acc gac etc acg gca cag acc agg tgc atg gtc tec 672 
lie Leu Glu val Thr Asp Leu Thr Ala Gin Thr Arg Cys Met val Ser 
210 215 220 

tgg tac gtc gac gag ttg etc egg aaa etc gat gag etc gee gqc gtc 720 
Trp Tyr val Asp Glu Leu Leu Arg Lys Leu Asp Glu Leu Ala Gly Val 
225 230 235 240 

gag cct gcg get gtc gqc acc tac cag caa cgc tac ttg gqa gac ate 768 
Glu Pro Ala Ala val Gly Thr Tyr Gin Gin Arg Tyr Leu Gly Asp lie 
245 250 255 

gcg gcg aag cac gqa ccg gqa cca gca cag ctg ate gec gcg gtt gcg 816 
Ala Ala Lys His Gly Pro Gly Pro Ala Gin Leu lie Ala Ala val Ala 
260 265 270 

gaa tac egg aaa cat ccg gat tac gec aga aac gag gaa age atg gqt 864 
Glu Tyr Arg Lys His Pro Asp Tyr Ala Arg Asn Glu Glu Ser Met Gly 
275 280 285 

ttc atg etc ctg cag get cga aag aag cag tec tga 900 
Phe Met Leu Leu Gin Ala Arg Lys Lys Gin Ser 
290 295 

<210> 20 
<211> 299 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 20 

val Asn Ser Arg Gly Ala Asn Thr Gin Ala Phe Pro Thr Ala Asp Gin 
1 5 10 15 

val Glu Ser lie Phe Asp Ala Leu Ala His Gly Arg Pro Leu His His 
20 25 30 

Gly Tyr Trp Ala Gly Gly Tyr Arg Glu Asp Ala Gly Ala Thr Pro Trp 
35 40 45 

Ser Asp Ala Ala Asp Gin Leu Thr Asp Leu Phe lie Asp Lys Ala Ala 
50 55 60 

Leu Arg Pro Gly Ala His Leu Phe Asp Leu Gly Cys Gly Asn Gly Gin 
65 70 75 80 

Pro Val val Arg Ala Ala Cys Ala Ser Gly Val Arg val Thr Gly lie 
85 90 ' 95 

Thr val Asn Ala Gin His Leu Ala Ala Ala Thr Arg Leu Ala Asn Glu 
100 105 " 110 
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Thr Gly Leu Ala Gly ser Leu Glu Phe Asp Leu Val Asp Gly Ala Gin 
115 120 125 

Leu Pro Tyr Pro Asp Gly Phe Phe Gin Ala Ala Trp Ala Met Gin Ser 
130 135 140 

Val val Gin lie val Asp Gin Ala Ala Ala lie Arg Glu Val His Arg 
145 150 155 160 

lie Leu Glu Pro Gly Gly Arg Phe val Leu Gly Asp lie lie Thr Arq 
165 " 170 175 

val Arg Leu Pro Glu Glu Tyr Ala Ala val Trp Thr Gly Thr Thr Ala 
180 185 190 

His Thr Leu Asn Ser Phe Thr Ala Leu val Ser Glu Ala Gly Phe Glu 
195 200 205 

lie Leu Glu Val Thr Asp Leu Thr Ala Gin Thr Arg Cys Met Val Ser 
210 215 220 

Trp Tyr val Asp Glu Leu Leu Arg Lys Leu Asp Glu Leu Ala Gly val 
225 230 235 240 

Glu Pro Ala Ala val Gly Thr Tyr Gin Gin Arg Tyr Leu Gly Asp lie 
245 250 255 

Ala Ala Lys His Gly Pro Gly Pro Ala Gin Leu lie Ala Ala Val Ala 
260 265 270 

Glu Tyr Arg Lys His Pro Asp Tyr Ala Arg Asn Glu Glu Ser Met Gly 
275 280 285 

Phe Met Leu Leu Gin Ala Arg Lys Lys Gin Ser 
290 295 



<210> 21 
<211> 1167 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1164) 

<223> ORF8; cyclization enzyme 

<400> 21 

atg gcc tec gag cac gec age ctg gtc gqc gac gat ctg egg gca ccc 48 

Met Ala Ser Glu His Ala Ser Leu Val Gly Asp Asp Leu Arg Ala Pro 
15 10 15 

gcg gat gat ccc ttc tac cga ccg ccg acg ccg eta ccg ccg gqt gtc 96 
Ala Asp Asp Pro Phe Tyr Arg Pro Pro Thr Pro Leu Pro Pro Gly val 
20 25 30 

ccg gqc acg etc etc agg gcc egg ccc gtc teg gca ctg cgc gqc acg 144 
Pro Gly Thr Leu Leu Arg Ala Arg Pro Val Ser Ala Leu Arg Gly Thr 
35 40 45 

ggc gaa ccc gtc gca gcc aag gcc tgg caa ate etc tac egg tec aac 192 
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Gly Glu Pro val Ala Ala Lys Ala Trp Gin lie Leu Tyr Arg Ser Asn 
50 55 60 

tec gec ctt gqc atg ccg aac gec gtc tec gqc acc gtt ctg gtq ccg 

Ser Ala Leu Gly Met Pro Asn Ala Val Ser Gly Thr val Leu Val Pro 

65 70 75 80 



cga aat gac gqg gtq ttc acc gag ccg gtq ctg gec gqc ctg gtc gqg 
Arg Asn Asp Gly val Phe Thr Gly Pro val Leu Ala Gly Leu val Gly 
225 230 235 240 



240 



aac ate ccg tgg ccg cgc gaa gat cgc ccc ate ate act ttc gca gtq 288 

Asn lie Pro Trp Pro Arg Glu Asp Arg Pro lie lie Thr Phe Ala Val 
85 90 95 

gqc acc cac gqc etc gqt age caa gtt gec ccg teg tac ctg ctt cga 336 

Gly Thr His Gly Leu Gly Ser Gin val Ala Pro Ser Tyr Leu Leu Arg 
100 105 110 

acc gqa acc gag ccg gag acc gag ctg ate gee gtq gee etc gac cgc 384 

Thr Gly Thr Glu Pro Glu Thr Glu Leu lie Ala val Ala Leu Asp Arg 

115 120 125 

999 tgg gec gtq gtc ate acc gac tac gaa gqc etc gqt act cct gqa 432 

Gly Trp Ala Val Val lie Thr Asp Tyr Glu Gly Leu Gly Thr Pro Gly 

130 135 140 

acc cac acc tac acc gtc gqc agg gcg cag gqa cac gec atg etc gat 480 

Thr His Thr Tyr Thr val Gly Arg Ala Gin Gly His Ala Met Leu Asp 
145 150 155 160 

gee gee cgc get gcg caa egg eta ccg gqc tec gqc ctg acg acc gac 528 

Ala Ala Arg Ala Ala Gin Arg Leu Pro Gly Ser Gly Leu Thr Thr Asp 
165 170 175 

tgc ccg gtc gqc ate tgg gqc tat gcg cag gqt gqg caa gcg teg gec 576 

Cys Pro Val Gly lie Trp Gly Tyr Ala Gin Gly Gly Gin Ala Ser Ala 
180 185 190 

ttc gee gqc gaa ctg cac ccc acc tac gca cct gaa ctg cga ate cgc 624 

Phe Ala Gly Glu Leu His Pro Thr Tyr Ala Pro Glu Leu Arg lie Arg 

195 200 205 

get gcg gee gca gqt gcg gtq ccg ate gat ctg ctg gac ate ate cac 672 

Ala Ala Ala Ala Gly Ala Val Pro lie Asp Leu Leu Asp lie lie His 

210 215 220 



720 



cat gee get gee tac ccc gat ctg cca ttc gac gag ctt etc acc gaa 768 

His Ala Ala Ala Tyr Pro Asp Leu Pro Phe Asp Glu Leu Leu Thr Glu 
245 250 255 

gcg gqt cgt acc gec gtt gat caa gtq cgc gag etc gqt gca ccg gag 816 

Ala Gly Arg Thr Ala val Asp Gin val Arg Glu Leu Gly Ala Pro Glu 

260 265 ~ 270 

etc gtc acc cgc ttc etc gqc cgc gag ctg age gac ttc etc gac act 864 

Leu Val Thr Arg Phe Leu Gly Arg Glu Leu Ser Asp Phe Leu Asp Thr 

275 ~ 280 285 

tec gqc ctt ttc gag caa cct cga tgg cgc gca cga ctg gee gaa age 912 

Ser Gly Leu Phe Glu Gin Pro Arg Trp Arg Ala Arg Leu Ala Glu Ser 
290 295 ~ 300 
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gtc gca gqt agg aac gat gqc ccg gtq gtc ccc acg etc gtc tac cac 960 
val Ala Gly Arg Asn Gly Gly Pro val val Pro Thr Leu val Tyr His 
305 ~ 310 315 320 

age acg gac gac gag ate gtt ccg ttc gca ttc gqc gag cga etc egg 1008 
Ser Thr Asp Asp Glu lie val Pro Phe Ala Phe Gly Glu Arg Leu Arg 
325 330 335 

gac age tac cgc gcg gcg gqt acg cca gtq egg tgg cat ccg etc tec 1056 
Asp Ser Tyr Arg Ala Ala Gly Thr Pro Val Arg Trp His Pro Leu Ser 
340 345 " 350 

gqa ttg get cac ttt ccc gee gee ctg gee age teg cga gtq gtc gtc 1104 
Gly Leu Ala His Phe Pro Ala Ala Leu Ala Ser Ser Arg val val val 
355 360 365 

teg tgg ttc gac gag cac ttc tec gag ccg tec gcg ate age gqt ccg 1152 
Ser Trp Phe Asp Glu His Phe ser Glu Pro Ser Ala lie Ser Gly Pro 
370 375 380 

cga gat gee agg tga 1167 

Arg Asp Ala Arg 

385 

<210> 22 
<211> 388 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 22 

Met Ala ser Glu His Ala Ser Leu Val Gly Asp Asp Leu Arg Ala Pro 
1 5 10 15 

Ala Asp Asp Pro Phe Tyr Arg Pro Pro Thr Pro Leu Pro Pro Gly val 
20 25 30 

Pro Gly Thr Leu Leu Arg Ala Arg Pro Val Ser Ala Leu Arg Gly Thr 
35 40 45 

Gly Glu Pro Val Ala Ala Lys Ala Trp Gin lie Leu Tyr Arg Ser Asn 
50 55 60 

Ser Ala Leu Gly Met Pro Asn Ala val Ser Gly Thr val Leu val Pro 
65 70 75 80 

Asn lie Pro Trp Pro Arg Glu Asp Arg Pro lie lie Thr Phe Ala val 
85 90 95 

Gly Thr His Gly Leu Gly Ser Gin Val Ala Pro Ser Tyr Leu Leu Arg 
100 105 110 

Thr Gly Thr Glu Pro Glu Thr Glu Leu lie Ala Val Ala Leu Asp Arg 
115 120 125 

Gly Trp Ala val Val lie Thr Asp Tyr Glu Gly Leu Gly Thr Pro Gly 
130 135 140 

Thr His Thr Tyr Thr val Gly Arg Ala Gin Gly His Ala Met Leu Asp 
145 150 155 160 

Ala Ala Arg Ala Ala Gin Arg Leu Pro Gly Ser Gly Leu Thr Thr Asp 
165 170 175 
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Cys Pro val Gly lie Trp Gly Tyr Ala Gin Gly Gly Gin Ala Ser Ala 
180 185 190 

Phe Ala Gly Glu Leu His Pro Thr Tyr Ala Pro Glu Leu Arg lie Arg 
195 200 205 

Ala Ala Ala Ala Gly Ala val Pro lie Asp Leu Leu Asp lie lie His 
210 215 220 

Arg Asn Asp Gly val Phe Thr Gly Pro Val Leu Ala Gly Leu Val Gly 
225 230 * 235 240 

His Ala Ala Ala Tyr Pro Asp Leu Pro Phe Asp Glu Leu Leu Thr Glu 
245 250 255 

Ala Gly Arg Thr Ala Val Asp Gin val Arg Glu Leu Gly Ala Pro Glu 
260 265 ~ 270 

Leu Val Thr Arg Phe Leu Gly Arg Glu Leu Ser Asp Phe Leu Asp Thr 
275 280 285 

Ser Gly Leu Phe Glu Gin Pro Arg Trp Arg Ala Arg Leu Ala Glu Ser 
290 295 300 

Val Ala Gly Arg Asn Gly Gly Pro Val Val Pro Thr Leu val Tyr His 
305 310 315 320 

ser Thr Asp Asp Glu He Val Pro Phe Ala Phe Gly Glu Arg Leu Arg 
325 330 335 

Asp Ser Tyr Arg Ala Ala Gly Thr Pro val Arg Trp His Pro Leu Ser 
340 345 350 

Gly Leu Ala His Phe Pro Ala Ala Leu Ala Ser Ser Arg val val Val 
355 360 365 

Ser Trp Phe Asp Glu His Phe Ser Glu Pro Ser Ala lie Ser Gly Pro 
370 375 380 

Arg Asp Ala Arg 
385 



<210> 23 
<211> 1011 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(1008) 

<223> ORF9; 2,3-reductase 

<400> 23 

atg acc age teg atg cga aag ccg gtq cgc ate gqt gtq etc gqg tgc 48 
Met Thr Ser Ser Met Arg Lys Pro Val Arg lie Gly Val Leu Gly Cys 
1 5 10 15 

get tee tte geg tgg cga egg atg etg ccc gcg atg tgc gac gtq gec 96 

Ala Ser Phe Ala Trp Arg Arg Met Leu Pro Ala Met Cys Asp val Ala 
20 25 30 
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gaa aca gag gtq gtq gcg gtq gcg age cgt gat ccg gcg aaa gec gaa 144 

Glu Thr Glu val Val Ala val Ala Ser Arg Asp Pro Ala Lys Ala Glu 

35 40 45 

egg ttc gca gcg cga ttc gaa tgc gag gcg gtq ctg gqt tac cag egg 

Arg Phe Ala Ala Arg Phe Glu Cys Glu Ala Val Leu Gly Tyr Gin Arg 
50 55 60 



Leu val Gly Leu Ala Arg Arg Lys Asn Leu Leu Leu Arg Glu Asn Tyr 
115 120 125 

ctg ttc etc cac cac gqc egg cac gac gtq gtc cgc gac ctg ctg caa 
Leu Phe Leu His His Gly Arg His Asp Val val Arg Asp Leu Leu Gin 
130 135 140 



gqa gcg ttg ctg gac ate gqt gtc tat ccc gec cgt gec get egg 
Gly Ala Leu Leu Asp lie Gly val Tyr Pro Ala Arg Ala Ala Arg 
180 185 190 



ttt etc etc gqt ccg etc acg gtt etc gqc gca age teg cac gag gee 
Phe Leu Leu Gly Pro Leu Thr val Leu Gly Ala Ser Ser His Glu Ala 
195 200 205 



gcg ttc acg ccg ccc gec gag tgg cag gec gtq ate cga ate gag egg 
Ala Phe Thr Pro Pro Ala Glu Trp Gin Ala Val lie Arg lie Glu Arg 
260 265 270 

aag gqc gtt gtc gac gag ttg tec ttg cca gcg gaa gat cag gtt cgc 
Lys Gly val Val Asp Glu Leu Ser Leu Pro Ala Glu Asp Gin Val Arg 
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192 



etc ctg gag egg ccg gac ate gat gee gtc tac gtq ccg ttg ccg cct 240 
Leu Leu Glu Arg Pro Asp lie Asp Ala Val Tyr val Pro Leu Pro Pro 
65 70 75 80 

gqc atg cat gca gag tgg ate gqc aag gcg ctt gag gca gac aaa cac 288 
Gly Met His Ala Glu Trp lie Gly Lys Ala Leu Glu Ala Asp Lys His 
85 90 95 

gtq ctt gcg gag aaa ccg ctg acg acg acg gcg tec gac ace get cgc 336 
Val Leu Ala Glu Lys Pro Leu Thr Thr Thr Ala Ser Asp Thr Ala Arg 
100 105 110 

ctg gtc gqg ctg gee agg agg aag aac ctg ctg ctg egg gag aat tac 384 



432 



tec gqg gag ate gqt gag etc egg gag ttc ace gee gtq ttc gqa att 480 

Ser Gly Glu lie Gly Glu Leu Arg Glu Phe Thr Ala Val Phe Gly lie 

145 150 155 160 

ccg ccg ctt ccc gac acg gac ate cgc tat cgc acc gaa etc gqt gqc 528 

Pro Pro Leu Pro Asp Thr Asp lie Arg Tyr Arg Thr Glu Leu Gly Gly 

165 170 ~ 175 



cac 576 
His 



624 



cag gag teg gqc gtc gac ttg teg gqc age gtq ctg etc caa teg gaa 672 

Gin Glu Ser Gly val Asp Leu Ser Gly Ser val Leu Leu Gin Ser Glu 

210 215 220 

gqt gqc acc gtt gee cac etc gqa tac gqt ttc gtq cac cac tac cgc 720 

Gly Gly Thr Val Ala His Leu Gly Tyr Gly Phe val His His Tyr Arg 

225 230 235 240 

age gcg tac gag ctg tgg gqg agt cgt gqg cga ate gtc gtc gac egg 768 

Ser Ala Tyr Glu Leu Trp Gly Ser Arg Gly Arg lie Val Val Asp Arg 

245 250 255 



816 



864 



l_eA33955-US 
275 280 285 

aag gcg gtc acc gcc ttc gca cgc gac ate aga gca gqg aca ggc gtg 912 
Lys Ala val Thr Ala Phe Ala Arg Asp lie Arg Ala Gly Thr Gly val 
290 295 ~ 300 

gac gac cct gcg gtg gcc gga gat teg ggc gaa teg atg ate cag cag 960 
Asp Asp Pro Ala Val Ala Gly Asp Ser Gly Glu Ser Met lie Gin Gin 
305 310 315 320 

gcc gcg ctg gtg gag gcg ate ggt cag gcc cgt egg tgc ggg tec aca 1008 
Ala Ala Leu val Glu Ala lie Gly Gin Ala Arg Arg cys Gly Ser Thr 
325 330 " 335 

tag 1011 

<210> 24 
<211> 336 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 24 

Met Thr Ser Ser Met Arg Lys Pro Val Arg lie Gly Val Leu Gly Cys 
1 5 10 15 

Ala Ser Phe Ala Trp Arg Arg Met Leu Pro Ala Met cys Asp Val Ala 
20 ~ 25 30 

Glu Thr Glu val val Ala val Ala Ser Arg Asp Pro Ala Lys Ala Glu 
35 40 " 45 

Arg Phe Ala Ala Arg Phe Glu Cys Glu Ala Val Leu Gly Tyr Gin Arg 
50 55 60 

Leu Leu Glu Arg Pro Asp lie Asp Ala val Tyr val Pro Leu Pro Pro 
65 ^ 70 75 80 

Gly Met His Ala Glu Trp lie Gly Lys Ala Leu Glu Ala Asp Lys His 
85 90 95 

val Leu Ala Glu Lys Pro Leu Thr Thr Thr Ala Ser Asp Thr Ala Arg 
100 105 110 

Leu Val Gly Leu Ala Arg Arg Lys Asn Leu Leu Leu Arg Glu Asn Tyr 
115 120 125 

Leu Phe Leu His His Gly Arg His Asp Val Val Arg Asp Leu Leu Gin 
130 135 140 

Ser Gly Glu lie Gly Glu Leu Arg Glu Phe Thr Ala Val Phe Gly lie 
145 150 155 160 

Pro Pro Leu Pro Asp Thr Asp lie Arg Tyr Arg Thr Glu Leu Gly Gly 
165 170 ~ 175 

Gly Ala Leu Leu Asp lie Gly val Tyr Pro Ala Arg Ala Ala Arg His 
180 185 ^ 190 

Phe Leu Leu Gly Pro Leu Thr val Leu Gly Ala Ser Ser His Glu Ala 
195 200 205 

Gin Glu Ser Gly val Asp Leu Ser Gly Ser val Leu Leu Gin Ser Glu 
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210 215 220 

Gly Gly Thr Val Ala His Leu Gly Tyr Gly Phe Val His His Tyr Arg 

225 230 235 240 

Ser Ala Tyr Glu Leu Trp Gly Ser Arg Gly Arg lie val val Asp Arg 

245 250 255 

Ala Phe Thr Pro Pro Ala Glu Trp Gin Ala val lie Arg lie Glu Arg 

260 265 270 

Lys Gly val val Asp Glu Leu Ser Leu Pro Ala Glu Asp Gin Val Arg 

275 280 285 

Lys Ala val Thr Ala Phe Ala Arg Asp lie Arg Ala Gly Thr Gly val 

290 295 " 300 

Asp Asp Pro Ala val Ala Gly Asp Ser Gly Glu Ser Met lie Gin Gin 

305 310 315 320 

Ala Ala Leu val Glu Ala lie Gly Gin Ala Arg Arg Cys Gly Ser Thr 

325 330 ~ 335 



<210> 25 
<211> 1461 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(1458) 

<223> ORF10; 2,3-dehydratase 

<400> 25 

atg age agt tct gtc gaa get gag gca agt get get gcg ccg etc gqc 48 

Met Ser Ser Ser val Glu Ala Glu Ala Ser Ala Ala Ala Pro Leu Gly 
15 10 15 

age aac aac acg egg egg ttc gtc gac tct gcg ctg age get tgc aat 96 
Ser Asn Asn Thr Arg Arg Phe val Asp Ser Ala Leu Ser Ala Cys Asn 
20 ~ 25 30 

gqc atg att ccg acc acg gag ttc cac tgc tgg etc gee gat egg ctg 144 
Gly Met lie Pro Thr Thr Glu Phe His Cys Trp Leu Ala Asp Arg Leu 
35 40 45 

gqc gag aac age ttc gag acc aat cgc ate ccg ttc gac cgc ctg teg 192 
Gly Glu Asn Ser Phe Glu Thr Asn Arg lie Pro Phe Asp Arg Leu Ser 
50 55 60 

aaa tgg aaa ttc gat gec age acg gag aac ctg gtt cat gec gac gqt 240 
Lys Trp Lys Phe Asp Ala Ser Thr Glu Asn Leu Val His Ala Asp Gly 
65 70 75 80 

agg ttc ttc acg gta gaa gqc ctg cag gtc gag acc aac tat gqc gcg 288 
Arg Phe Phe Thr val Glu Gly Leu Gin val Glu Thr Asn Tyr Gly Ala 
85 90 95 

gca ccc age tgg cac cag ccg ate ate aac cag get gaa gta gqt ate 336 
Ala Pro Ser Trp His Gin Pro lie lie Asn Gin Ala Glu Val Gly lie 
100 105 110 
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etc gqc att etc gtc aag gag ate gac gqc gtq ctg cac tgc etc atg 384 
Leu Gly lie Leu val Lys Glu lie Asp Gly val Leu His cys Leu Met 
115 120 125 

tea gca aag atg gaa ccg gqc aac gtc aac gtc ctg cag etc teg ccg 432 
Ser Ala Lys Met Glu Pro Gly Asn Val Asn Val Leu Gin Leu Ser Pro 
130 135 140 

acg gtt cag gca act egg age aac tac acg cag gca cac cgt g^c age 480 



Thr val Gin Ala Thr Arg Ser Asn Tyr Thr Gin Ala His Arg Gly Ser 

145 150 155 " 160 

gtt ccg ccc tat gtq gac tac ttc etc gqg egg gqc cgc gqc cgc gtq 

val Pro Pro Tyr val Asp Tyr Phe Leu Gly Arg Gly Arg Gly Arg val 



acg cag ccg atg ate gaa ccg cga gaa caa gqt gag ate gca ctg ttg 
Thr Gin Pro Met lie Glu Pro Arg Glu Gin Gly Glu lie Ala Leu Leu 
340 345 350 



528 



ctg gta gac gtq etc cag tct gaa cag gqg tec tgg ttc tac egg aag 576 

Leu val Asp val Leu Gin Ser Glu Gin Gly ser Trp Phe Tyr Arg Lys 

180 185 190 

cgc aac egg aac atg gtq gtq gaa gtc cag gag gaa gtq cca gtc ctg 624 

Arg Asn Arg Asn Met val val Glu val Gin Glu Glu val Pro val Leu 

195 200 205 

cca gac ttc tgc tgg ttg acg etc gqc cag gtq ctg get etc ctt cgt 672 

Pro Asp Phe Cys Trp Leu Thr Leu Gly Gin Val Leu Ala Leu Leu Arg 

210 215 220 

cag gac aac ate gtc aac atg gac acc egg acg gtq ctg tct tgc ate 720 

Gin Asp Asn lie val Asn Met Asp Thr Arg Thr val Leu ser Cys lie 

225 230 235 240 

ccg ttc cac gat tec gee acc gqa ccc gaa eta gee gee teg gag gag 768 

Pro Phe His Asp Ser Ala Thr Gly Pro Glu Leu Ala Ala Ser Glu Glu 
245 250 255 

ccc ttc cga cag gcg gtq gee agg teg etc teg cac gqc ate gat teg 816 

Pro Phe Arg Gin Ala val Ala Arg Ser Leu Ser His Gly lie Asp Ser 

260 265 270 

teg agt ate tec gag gcg gtc gqt tgg ttc gag gaa gee aag gee cgc 864 

Ser ser lie Ser Glu Ala val Gly Trp Phe Glu Glu Ala Lys Ala Arg 

275 280 285 

tac cgc ttg egg gca acg cgc gtt ccg ctg age agg gtc gac aag tgg 912 

Tyr Arg Leu Arg Ala Thr Arg val Pro Leu Ser Arg val Asp Lys Trp 

290 295 300 

tat cgc acc gat acc gag ate gee cac cag gac gqc aag tac ttc gcg 960 

Tyr Arg Thr Asp Thr Glu lie Ala His Gin Asp Gly Lys Tyr Phe Ala 

305 310 315 320 

gtq ate gcg gtq teg gtq tec gcg acc aat cgt gag gtc gee age tgg 1008 

Val lie Ala val Ser Val Ser Ala Thr Asn Arg Glu Val Ala Ser Trp 
325 330 335 



1056 



gtc aag egg ate gqc gqa gtq ctg cac gqt ttg gtc cac get egg gtq 1104 
val Lys Arg lie Gly Gly Val Leu His Gly Leu Val His Ala Arg Val 
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355 360 365 

gag get ggg tat aag tgg act gcg gaa ate get ccc acg gtc cag tgc 1152 

Glu Ala Gly Tyr Lys Trp Thr Ala Glu lie Ala Pro Thr val Gin Cys 
370 375 380 

agt gtg gec aac tac caa age acc ccg teg aac gac tgg ccg ccg ttc 1200 

Ser val Ala Asn Tyr Gin ser Thr Pro Ser Asn Asp Trp Pro Pro Phe 
385 390 395 400 



ttg gac gac gtg etc acc gee gat ccc gaa acc gtg egg tac gaa teg 
Leu Asp Asp val Leu Thr Ala Asp Pro Glu Thr Val Arg Tyr Glu Ser 
405 410 415 



1248 



ate ctg tec gaa gaa ggc ggt egg ttc tac cag gcg cag aac agg tac 1296 
lie Leu ser Glu Glu Gly Gly Arg Phe Tyr Gin Ala Gin Asn Arg Tyr 
420 425 430 

egg ate ate gag gtg cat gag gac ttc gcg gca cga cct ccc age gac 1344 
Arg He lie Glu val His Glu Asp Phe Ala Ala Arg Pro Pro Ser Asp 
435 440 445 

ttc egg tgg atg act ttg gga cag ttg ggc gag ctg etc egg age acc 1392 
Phe Arg Trp Met Thr Leu Gly Gin Leu Gly Glu Leu Leu Arg Ser Thr 
450 455 460 

cac ttc ttg aac ate cag gcg cgc age ttg gtc gec tec ctg cat age 1440 
His Phe Leu Asn lie Gin Ala Arg Ser Leu val Ala Ser Leu His Ser 
465 470 475 480 

ttg tgg gcg ttg ggg cga tga 1461 
Leu Trp Ala Leu Gly Arg 
485 

<210> 26 
<211> 486 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 26 

Met ser ser ser val Glu Ala Glu Ala Ser Ala Ala Ala Pro Leu Gly 
15 10 15 

Ser Asn Asn Thr Arg Arg Phe Val Asp Ser Ala Leu Ser Ala Cys Asn 
20 25 30 

Gly Met lie Pro Thr Thr Glu Phe His Cys Trp Leu Ala Asp Arg Leu 
35 40 45 

Gly Glu Asn Ser Phe Glu Thr Asn Arg lie Pro Phe Asp Arg Leu Ser 
50 55 60 

Lys Trp Lys Phe Asp Ala Ser Thr Glu Asn Leu Val His Ala Asp Gly 
65 70 75 80 

Arg Phe Phe Thr val Glu Gly Leu Gin val Glu Thr Asn Tyr Gly Ala 
85 90 95 

Ala Pro ser Trp His Gin Pro lie He Asn Gin Ala Glu val Gly lie 
100 105 110 

Leu Gly lie Leu val Lys Glu lie Asp Gly val Leu His Cys Leu Met 
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115 120 125 

Ser Ala Lys Met Glu Pro Gly Asn val Asn val Leu Gin Leu Ser Pro 
130 135 140 

Thr val Gin Ala Thr Arg Ser Asn Tyr Thr Gin Ala His Arg Gly Ser 
145 150 155 " 160 

Val Pro Pro Tyr val Asp Tyr Phe Leu Gly Arg Gly Arg Gly Arg Val 
165 170 ~ 175 

Leu Val Asp Val Leu Gin Ser Glu Gin Gly Ser Trp Phe Tyr Arg Lys 
180 185 190 

Arg Asn Arg Asn Met val val Glu val Gin Glu Glu val Pro val Leu 
195 200 205 

Pro Asp Phe Cys Trp Leu Thr Leu Gly Gin Val Leu Ala Leu Leu Arg 
210 215 220 

Gin Asp Asn lie val Asn Met Asp Thr Arg Thr Val Leu Ser Cys lie 
225 230 235 240 

Pro Phe His Asp Ser Ala Thr Gly Pro Glu Leu Ala Ala ser Glu Glu 
245 250 255 

Pro Phe Arg Gin Ala val Ala Arg Ser Leu Ser His Gly lie Asp Ser 
260 265 270 

Ser ser lie Ser Glu Ala val Gly Trp Phe Glu Glu Ala Lys Ala Arg 
275 280 285 

Tyr Arg Leu Arg Ala Thr Arg val Pro Leu Ser Arg val Asp Lys Trp 
290 " 295 300 

Tyr Arg Thr Asp Thr Glu lie Ala His Gin Asp Gly Lys Tyr Phe Ala 
305 ~ 310 315 320 

val lie Ala val Ser val Ser Ala Thr Asn Arg Glu val Ala Ser Trp 
325 330 ~ 335 

Thr Gin Pro Met lie Glu Pro Arg Glu Gin Gly Glu lie Ala Leu Leu 
340 345 350 

Val Lys Arg lie Gly Gly Val Leu His Gly Leu Val His Ala Arg Val 
355 360 365 

Glu Ala Gly Tyr Lys Trp Thr Ala Glu lie Ala Pro Thr val Gin Cys 
370 375 380 

Ser val Ala Asn Tyr Gin Ser Thr Pro ser Asn Asp Trp Pro Pro Phe 
385 390 395 400 

Leu Asp Asp val Leu Thr Ala Asp Pro Glu Thr val Arg Tyr Glu Ser 
405 410 415 

lie Leu Ser Glu Glu Gly Gly Arg Phe Tyr Gin Ala Gin Asn Arg Tyr 
420 425 430 

Arg lie lie Glu val His Glu Asp Phe Ala Ala Arg Pro Pro Ser Asp 
435 440 445 

Phe Arg Trp Met Thr Leu Gly Gin Leu Gly Glu Leu Leu Arg Ser Thr 

Page 97 



LeA33955-US 

450 455 460 

His Phe Leu Asn lie Gin Ala Arg Ser Leu Val Ala Ser Leu His Ser 
465 470 475 480 

Leu Trp Ala Leu Gly Arg 
485 



<210> 27 
<211> 524 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(438) 

<223> ORFll; thioesterase 

<400> 27 

gtq age aac gtg tgg ccg gaa aca tgg acg ccg ggg ttt gqc agg tgt 
val Ser Asn Val Trp Pro Glu Thr Trp Thr Pro Gly Phe Gly Arg cys 
1 5 10 15 



gcg cca gga gcg aag gtq gqt tgc ccg ate acc gtq ctg gtq gta gat 
Ala Pro Gly Ala Lys Val Gly Cys Pro lie Thr Val Leu val Val Asp 
85 * 90 95 



48 



tea teg ctg ttg cgt cga etc gga ttc cgc cgt gac egg gac gat gee 96 
Ser Ser Leu Leu Arg Arg Leu Gly Phe Arg Arg Asp Arg Asp Asp Ala 
20 25 30 

agg cga gtc ccg aag tea gat tct tgt cca gaa teg tec aat gqg gtq 144 
Arg Arg val Pro Lys Ser Asp Ser cys Pro Glu Ser Ser Asn Gly val 
35 40 45 

ttg ate tec cca gag gtt tgc get cca acc gat ttc cga cga gga teg 192 
Leu lie Ser Pro Glu val Cys Ala Pro Thr Asp Phe Arg Arg Gly Ser 
50 55 60 

tgg cgc ccg ctg age aac gac tac cgt gcg gtc gag aca tac cgc tgt 240 
Trp Arg Pro Leu Ser Asn Asp Tyr Arg Ala val Glu Thr Tyr Arg Cys 
65 70 75 80 



288 



gec gag ccg aag gtc acc ttg gat gag gcg gaa gec tgg cga gag cac 336 
Ala Glu Pro Lys Val Thr Leu Asp Glu Ala Glu Ala Trp Arg Glu His 
100 105 110 

acc gag gec gtg gec gac gtc cgt gtc ttc tec ggc gqg cat ttc ttc 384 
Thr Glu Ala Val Ala Asp Val Arg Val Phe Ser Gly Gly His Phe Phe 
115 120 125 

atg acc gaa cgc cag gac gag gtg etc gcg gtc ctt acg gqc gga teg 432 
Met Thr Glu Arg Gin Asp Glu Val Leu Ala val Leu Thr Gly Gly Ser 
130 135 140 

ctt cga tgatcctcgc caggccgctg gaccagaccg cgacgcccct gggagcegge 488 

Leu Arg 

145 

gtgeacateg teaeggcagt gagggattgg gcatga 524 
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Trp Thr 
10 

Phe Arg 

25 

Cys Pro 

Pro Thr 

Arg Ala 

Pro lie 
90 

Glu Ala 
105 

val Phe 
Leu Ala 



Pro Gly Phe 

Arg Asp Arg 

Glu Ser ser 
45 

Asp Phe Arg 
60 

val Glu Thr 
75 

Thr val Leu 
Glu Ala Trp 



Ser Gly Gly 
125 

val Leu Thr 
140 



Gly Arg Cys 
15 

Asp Asp Ala 
30 

Asn Gly val 

Arg Gly Ser 

Tyr Arg Cys 
80 

val val Asp 
95 

Arg Glu His 
110 

His Phe Phe 
Gly Gly Ser 



<210> 28 
<211> 146 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 28 

val Ser Asn Val Trp Pro Glu Thr 
1 5 

ser ser Leu Leu Arg Arg Leu Gly 
20 

Arg Arg val Pro Lys ser Asp Ser 
35 40 

Leu lie Ser Pro Glu Val Cys Ala 
50 55 

Trp Arg Pro Leu Ser Asn Asp Tyr 
65 70 

Ala Pro Gly Ala Lys Val Gly Cys 



Ala Glu Pro Lys Val Thr Leu Asp 
100 

Thr Glu Ala val Ala Asp val Arg 
115 120 

Met Thr Glu Arg Gin Asp Glu Val 
130 ~ 135 

Leu Arg 
145 



<210> 29 
<211> 1320 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 
<221> CDS 
<222> (1)..(1317) 
<223> ORF12; glycosyl transferase 

<400> 29 

atg cgt gtc ctg ttc ace ccg ctg ccg 
Met Arg Val Leu Phe Thr Pro Leu Pro 
1 ~ 5 

ctg gtq ccg ttg gcg tgg gcg ttg cgt 
Leu val Pro Leu Ala Trp Ala Leu Arg 
20 25 

gtc gcc ate tgc ccg aat atg gtq teg 
val Ala lie Cys Pro Asn Met Val Ser 
35 40 

acc gcg gtt ccc gtc ggc gac gag etc 



gcg agt teg cac ttc ttc aac 48 
Ala ser Ser His Phe Phe Asn 
10 15 

gcc gcg gqg cac gag gtc cgt 96 
Ala Ala Gly His Glu val Arg 
30 

atg gtc acc gqa gca gqa etc 144 
Met val Thr Gly Ala Gly Leu 
45 

gac etc ate tec ttg gcg gcc 192 
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Thr Ala Val Pro val Gly Asp Glu Leu Asp Leu lie Ser Leu Ala Ala 
50 55 60 

aag aac gaa etc gtt etc ggc age ggg gtc teg ttc gac gag aag ggg 240 
Lys Asn Glu Leu Val Leu Gly Ser Gly val Ser Phe Asp Glu Lys Gly 
65 70 75 80 

egg cat ccg gaa etc ttc gac gag ctg ctg tea ate aac tec ggc aga 288 
Arg His Pro Glu Leu Phe Asp Glu Leu Leu ser lie Asn Ser Gly Arg 
85 90 95 

gac acg gac gec gtg gag caa etc cac ctt gtg gat gac cga teg ctg 
Asp Thr Asp Ala val Glu Gin Leu His Leu Val Asp Asp Arg Ser Leu 
100 105 110 

gac gat etc atg ggg ttc gee gag aaa tgg cag cct gat etc gtt gtg 
Asp Asp Leu Met Gly Phe Ala Glu Lys Trp Gin Pro Asp Leu Val val 
115 120 125 



gca cga cac gtg egg atg etc gtc gee etc gat gtg teg ggg tgg ctg 
Ala Arg His val Arg Met Leu Val Ala Leu Asp Val ser Gly Trp Leu 
145 150 155 160 



gtc gac ccg etc ggg acg tgg ctg gga gcg aag etc gee aag ttc gga 

val Asp Pro Leu Gly Thr Trp Leu Gly Ala Lys Leu Ala Lys Phe Gly 

180 185 190 

gec acg ttc gat gaa gag ate gtg acg ggc caa gcg acc ata gat ccg 

Ala Thr Phe Asp Glu Glu lie Val Thr Gly Gin Ala Thr lie Asp Pro 
195 200 205 



336 



384 



tgg gac get atg gtg tgt teg ggg cca gtt gtg gcg cga gcg etc ggc 432 
Trp Asp Ala Met val Cys Ser Gly Pro Val val Ala Arg Ala Leu Gly 
130 135 140 



480 



egg tec ggt ttc etc gaa tac cag gaa teg aag ccg cct gag cag cgc 528 
Arg Ser Gly Phe Leu Glu Tyr Gin Glu Ser Lys Pro Pro Glu Gin Arg 
165 170 175 



576 



624 



att cca tec tgg atg cgc ctg cct gtg gac ttg gac tac ate teg atg 672 
lie Pro ser Trp Met Arg Leu Pro val Asp Leu Asp Tyr lie Ser Met 
210 215 220 

cgt ttc gtg ccg tac aac ggt ccg gcg gtg ttg ccg gag tgg ttg cgc 720 
Arg Phe val Pro Tyr Asn Gly Pro Ala val Leu Pro Glu Trp Leu Arg 
225 230 235 240 

gaa cga ccg acg aag ccg cgc gtc tgc ate acg cgc ggg ctg acc aag 
Glu Arg Pro Thr Lys Pro Arg Val Cys lie Thr Arg Gly Leu Thr Lys 
245 250 ~ 255 



768 



egg egg ctg age agg gtg acc gaa cag tac ggg gag caa agt gac cag 816 

Arg Arg Leu Ser Arg Val Thr Glu Gin Tyr Gly Glu Gin Ser Asp Gin 
260 265 270 

gaa caa gca atg gtg gaa agg ttg ttg cgc ggc gcg gec agg etc gac 864 

Glu Gin Ala Met val Glu Arg Leu Leu Arg Gly Ala Ala Arg Leu Asp 
275 280 285 

gtc gag gtg ate gec acc ttg tct gac gac gaa gta egg gag atg ggg 912 

val Glu val lie Ala Thr Leu Ser Asp Asp Glu val Arg Glu Met Gly 
290 295 300 
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gag ttg ccc teg aac gtc egg gtc cac gaa tac gta ccg etc aac gaa 960 
Glu Leu Pro Ser Asn Val Arg val His Glu Tyr Val Pro Leu Asn Glu 
305 310 ~ 315 320 

ctg ctg gag teg tgt tea gtq ate ate cat cat gqc teg acg acg acg 1008 
Leu Leu Glu Ser Cys Ser Val lie lie His His Gly Ser Thr Thr Thr 
325 330 335 

cag gaa acc gec acg gtc aac gqc gta ccg cag ttg att etc cct gqg 
Gin Glu Thr Ala Thr Val Asn Gly Val Pro Gin Leu lie Leu Pro Gly 
340 345 350 



ly Leu val Leu Asp Pro Ala Thr Phe Thr Glu Asp Asp Val Arg 
370 375 380 

gqt cag ctg gee cgc ctg etc gac gag ccg teg ttc get gee aac gcg 

Gly Gin Leu Ala Arg Leu Leu Asp Glu Pro Ser Phe Ala Ala Asn Ala 

385 390 395 400 



1056 



acc ttc tgg gac gaa tct cgt agg gcg gag etc eta gee gat egg gqa 1104 

Thr Phe Trp Asp Glu Ser Arg Arg Ala Glu Leu Leu Ala Asp Arg Gly 
355 360 365 

gee gqt ctg gtc etc gac ccc gcg acg ttt acc gaa gac gac gtq cga 1152 

Ala Gly Leu val Leu Asp Pro Ala Thr Phe Thr Glu Asp Asp val 



1200 



gcg ctg ate cgc cgt gaa ate gag gaa agt ccc age ccg cac gac ate 1248 
Ala Leu lie Arg Arg Glu lie Glu Glu Ser Pro Ser Pro His Asp lie 
405 410 415 

gtt cca cgt ctg gaa aag eta gtt gee gaa cgt gag aac cgc cgc act 1296 
val Pro Arg Leu Glu Lys Leu Val Ala Glu Arg Glu Asn Arg Arg Thr 
420 425 430 

gqg cag tct gat gqc cat ccg tga 1320 
Gly Gin Ser Asp Gly His Pro 
435 

<210> 30 
<211> 439 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 30 

Met Arg val Leu Phe Thr Pro Leu Pro Ala Ser Ser His Phe Phe Asn 
1 5 10 15 

Leu val Pro Leu Ala Trp Ala Leu Arg Ala Ala Gly His Glu Val Arg 
20 25 30 

val Ala lie Cys Pro Asn Met val Ser Met val Thr Gly Ala Gly Leu 
35 40 45 

Thr Ala val Pro Val Gly Asp Glu Leu Asp Leu lie Ser Leu Ala Ala 
50 55 60 

Lys Asn Glu Leu val Leu Gly Ser Gly val Ser Phe Asp Glu Lys Gly 
65 70 75 80 

Arg His Pro Glu Leu Phe Asp Glu Leu Leu Ser lie Asn Ser Gly Arg 
85 90 95 

Asp Thr Asp Ala val Glu Gin Leu His Leu Val Asp Asp Arg Ser Leu 
100 105 110 
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Asp Asp Leu Met Gly Phe Ala Glu Lys Trp Gin Pro Asp Leu val val 
115 120 125 

Trp Asp Ala Met val cys Ser Gly Pro val val Ala Arg Ala Leu Gly 
130 135 140 

Ala Arg His val Arg Met Leu val Ala Leu Asp val Ser Gly Trp Leu 
145 150 155 160 

Arg Ser Gly Phe Leu Glu Tyr Gin Glu Ser Lys Pro Pro Glu Gin Arg 
165 170 175 

val Asp Pro Leu Gly Thr Trp Leu Gly Ala Lys Leu Ala Lys Phe Gly 
180 185 ' 190 

Ala Thr Phe Asp Glu Glu lie val Thr Gly Gin Ala Thr lie Asp Pro 
195 200 205 

lie Pro Ser Trp Met Arg Leu Pro Val Asp Leu Asp Tyr lie Ser Met 
210 215 220 

Arg Phe Val Pro Tyr Asn Gly Pro Ala Val Leu Pro Glu Trp Leu Arg 
225 230 235 240 

Glu Arg Pro Thr Lys Pro Arg val Cys lie Thr Arg Gly Leu Thr Lys 
245 ~ 250 " 255 

Arg Arg Leu Ser Arg val Thr Glu Gin Tyr Gly Glu Gin Ser Asp Gin 
260 265 270 

Glu Gin Ala Met val Glu Arg Leu Leu Arg Gly Ala Ala Arg Leu Asp 
275 280 " ' 285 ' 

val Glu val lie Ala Thr Leu ser Asp Asp Glu val Arg Glu Met Gly 
290 295 300 

Glu Leu Pro Ser Asn val Arg val His Glu Tyr val Pro Leu Asn Glu 
305 310 315 320 

Leu Leu Glu Ser Cys Ser val lie lie His His Gly Ser Thr Thr Thr 
325 330 335 

Gin Glu Thr Ala Thr val Asn Gly Val Pro Gin Leu lie Leu Pro Gly 
340 345 350 

Thr Phe Trp Asp Glu Ser Arg Arg Ala Glu Leu Leu Ala Asp Arg Gly 
355 360 365 

Ala Gly Leu Val Leu Asp Pro Ala Thr Phe Thr Glu Asp Asp val Arg 
370 375 380 

Gly Gin Leu Ala Arg Leu Leu Asp Glu Pro Ser Phe Ala Ala Asn Ala 
385 390 395 400 

Ala Leu lie Arg Arg Glu lie Glu Glu Ser Pro Ser Pro His Asp lie 
405 410 415 

Val Pro Arg Leu Glu Lys Leu Val Ala Glu Arg Glu Asn Arg Arg Thr 
420 425 430 

Gly Gin ser Asp Gly His Pro 
435 
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<210> 31 
<211> 1389 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1386) 

<223> ORF13; 3,4-dehydratase 

<400> 31 

atg cag age egg aaa acc aga gcg ctg gqg aaa gqg cgc gec aga gtq 48 

Met Gin Ser Arg Lys Thr Arg Ala Leu Gly Lys Gly Arg Ala Arg val 
1 5 10 15 

act teg tgt gac gac act tgc get acc get act gag atg gtq ccg gat 96 
Thr Ser cys Asp Asp Thr Cys Ala Thr Ala Thr Glu Met val Pro Asp 
20 25 30 

gee aag gac egg ata ttg gca tec gta cgc gat tac cac cgc gaa cag 144 
Ala Lys Asp Arg lie Leu Ala Ser val Arg Asp Tyr His Arg Glu Gin 
35 40 ~ 45 

gaa tec ccg acc ttc gtq get gqa teg acg ccg ate egg cca teg gqc 192 
Glu Ser Pro Thr Phe Val Ala Gly ser Thr Pro lie Arg Pro Ser Gly 
50 55 60 

gee gtq etc gac gag gac gac egg gtq gca ctg gtq gaa gee gcg ctg 240 
Ala val Leu Asp Glu Asp Asp Arg val Ala Leu val Glu Ala Ala Leu 
65 70 ~ 75 80 

gag etc egg ate gee gcg gqc gqg aat gca egg cga ttc gag age gag 288 
Glu Leu Arg lie Ala Ala Gly Gly Asn Ala Arg Arg Phe Glu Ser Glu 
85 90 95 

ttc gee cgc ttc ttc gqc etc cgc aag get cat etc gtc aac tec ggt 336 
Phe Ala Arg Phe Phe Gly Leu Arg Lys Ala His Leu val Asn Ser Gly 
100 105 110 

teg teg gee aat etc ctg gca ctg agt teg ctt acc tec ccc aaa etc 384 
Ser ser Ala Asn Leu Leu Ala Leu Ser Ser Leu Thr Ser Pro Lys Leu 
115 120 125 

gqc gag gca cga ctg egg ccc gqc gac gaa gtq ate act gcg gcg gtc 432 
Gly Glu Ala Arg Leu Arg Pro Gly Asp Glu val lie Thr Ala Ala val 
130 135 140 

gqc ttc ccc acg acg ate aat ccg gcg gtc caa aac gqa etc gtc ccg 480 
Gly Phe Pro Thr Thr lie Asn Pro Ala Val Gin Asn Gly Leu Val Pro 
145 150 155 ' 160 

gta ttc gtc gac gtq gaa ctg gqc acc tac aac gca acg cca gac cgc 528 
val Phe val Asp val Glu Leu Gly Thr Tyr Asn Ala Thr Pro Asp Arg 
165 170 175 

ate aag gee gee gtc acg gaa egg acg cga gee ate atg ctg gcg cac 576 
lie Lys Ala Ala val Thr Glu Arg Thr Arg Ala lie Met Leu Ala His 
180 185 190 

acc ctg ggc aac ccc ttc gee get gac gaa ate gcg gag ate gca aaa 624 
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Thr Leu Gly Asn Pro Phe Ala Ala Asp Glu lie Ala Glu lie Ala Lys 
195 200 205 

gaa cac gag ctg ttc etc gtc gaa gac aac tgt gat gcg gtg gga tec 
Glu His Glu Leu Phe Leu Val Glu Asp Asn Cys Asp Ala Val Gly Ser 
210 215 220 



age ttc tat cct gec cat cac ate ace age ggc gag ggt ggc tgc gtg 
Ser Phe Tyr Pro Ala His His lie Thr Ser Gly Glu Gly Gly cys Val 
245 250 255 



tgg gga egg gat tgc tgg tgc gag ccc gqc gtg gac aac acc tgc cgc 
Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Val Asp Asn Thr Cys Arg 
275 280 285 



aag tac acg ttc tec cac gtc ggt tac aac etc aag acc acc gac ctg 
Lys Tyr Thr Phe Ser His val Gly Tyr Asn Leu Lys Thr Thr Asp Leu 
305 310 315 320 



ctg ttc ggc ggt aac ate acc egg cac ccg gec ttc gag cag gtg egg 
Leu Phe Gly Gly Asn lie Thr Arg His Pro Ala Phe Glu Gin val Arg 
405 ~ 410 415 



672 



acc tac egg gga egg ctg acc gga acc ttc ggc gac ctg aca acg gtc 720 
Thr Tyr Arg Gly Arg Leu Thr Gly Thr Phe Gly Asp Leu Thr Thr Val 
225 230 235 240 



768 



ttg acc ggc age ctg gaa ttg get cgc ate ate gag teg ctg cgt gac 816 
Leu Thr Gly Ser Leu Glu Leu Ala Arg lie lie Glu Ser Leu Arg Asp 
260 265 270 



864 



aag agg ttc gac tac cac etc ggt acc ctt cca ccg ggc tac gac cac 912 
Lys Arg Phe Asp Tyr His Leu Gly Thr Leu Pro Pro Gly Tyr Asp His 
290 295 300 



960 



cag gec gca ctt gcg ctg age cag ttg age aag att tec gca ttc ggg 1008 

Gin Ala Ala Leu Ala Leu Ser Gin Leu ser Lys lie Ser Ala Phe Gly 

325 330 335 

teg gca cgc cgc cgt aac tgg cga egg ttg cgc gaa ggg ctg tec ggg 1056 

Ser Ala Arg Arg Arg Asn Trp Arg Arg Leu Arg Glu Gly Leu Ser Gly 
340 345 350 

ttg ccg ggc ctg ctg ctg ccg gta gec aca ccg cac age gac ccg age 1104 

Leu Pro Gly Leu Leu Leu Pro val Ala Thr Pro His Ser Asp Pro Ser 

355 360 365 

tgg ttc ggg ttt gcg ate acc ate agt gcg gac gec ggg ttc acc cgt 1152 

Trp Phe Gly Phe Ala lie Thr lie Ser Ala Asp Ala Gly Phe Thr Arg 
370 375 380 

gec gec ctg gtg aac ttc ctg gaa tec cgc aac ate ggc acc cga ctg 1200 

Ala Ala Leu val Asn Phe Leu Glu Ser Arg Asn lie Gly Thr Arg Leu 
385 390 395 400 



1248 



tac egg ate gee gac gcg etc acc aac age gac ate gtc acc gac cga 1296 

Tyr Arg lie Ala Asp Ala Leu Thr Asn Ser Asp lie Val Thr Asp Arg 
420 425 430 

acc ttc tgg gtc ggc gtc tac cca ggc ata acg gac caa atg ate gac 1344 

Thr Phe Trp val Gly Val Tyr Pro Gly lie Thr Asp Gin Met lie Asp 

435 440 445 
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tac gtc gtc gaa tea ate get gaa ttc gtq gee aag agt tec tag 1389 
Tyr val val Glu Ser lie Ala Glu Phe Val Ala Lys Ser Ser 
450 455 460 

<210> 32 
<211> 462 
<212> PRT 

<213> saccharopolyspora spinosa 
<400> 32 

Met Gin Ser Arg Lys Thr Arg Ala Leu Gly Lys Gly Arg Ala Arg Val 
1 5 10 " 15 

Thr ser Cys Asp Asp Thr Cys Ala Thr Ala Thr Glu Met val Pro Asp 
20 25 30 

Ala Lys Asp Arg lie Leu Ala Ser val Arg Asp Tyr His Arg Glu Gin 
35 40 45 

Glu Ser Pro Thr Phe val Ala Gly Ser Thr Pro lie Arg Pro Ser Gly 
50 55 60 

Ala val Leu Asp Glu Asp Asp Arg val Ala Leu Val Glu Ala Ala Leu 
65 70 75 80 

Glu Leu Arg lie Ala Ala Gly Gly Asn Ala Arg Arg Phe Glu Ser Glu 
85 90 ~ 95 

Phe Ala Arg Phe Phe Gly Leu Arg Lys Ala His Leu val Asn Ser Gly 
100 105 110 

Ser ser Ala Asn Leu Leu Ala Leu Ser Ser Leu Thr Ser Pro Lys Leu 
115 120 125 

Gly Glu Ala Arg Leu Arg Pro Gly Asp Glu val lie Thr Ala Ala Val 
130 135 140 

Gly Phe Pro Thr Thr lie Asn Pro Ala Val Gin Asn Gly Leu Val Pro 
145 150 155 160 

val Phe val Asp val Glu Leu Gly Thr Tyr Asn Ala Thr Pro Asp Arg 
165 170 175 

lie Lys Ala Ala Val Thr Glu Arg Thr Arg Ala lie Met Leu Ala His 
180 185 190 

Thr Leu Gly Asn Pro Phe Ala Ala Asp Glu lie Ala Glu lie Ala Lys 
195 200 205 

Glu His Glu Leu Phe Leu Val Glu Asp Asn Cys Asp Ala Val Gly Ser 
210 215 220 

Thr Tyr Arg Gly Arg Leu Thr Gly Thr Phe Gly Asp Leu Thr Thr val 
225 230 235 240 

Ser Phe Tyr Pro Ala His His lie Thr Ser Gly Glu Gly Gly Cys Val 
245 250 255 

Leu Thr Gly Ser Leu Glu Leu Ala Arg lie He Glu Ser Leu Arg Asp 
260 265 270 

Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Val Asp Asn Thr Cys Arg 
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275 280 285 

Lys Arg Phe Asp Tyr His Leu Gly Thr Leu Pro Pro Gly Tyr Asp His 
290 295 300 

Lys Tyr Thr Phe Ser His val Gly Tyr Asn Leu Lys Thr Thr Asp Leu 
305 310 315 320 

Gin Ala Ala Leu Ala Leu Ser Gin Leu ser Lys lie Ser Ala Phe Gly 
325 330 335 

Ser Ala Arg Arg Arg Asn Trp Arg Arg Leu Arg Glu Gly Leu Ser Gly 
340 345 ~ 350 

Leu Pro Gly Leu Leu Leu Pro Val Ala Thr Pro His Ser Asp Pro Ser 
355 360 365 

Trp Phe Gly Phe Ala lie Thr lie Ser Ala Asp Ala Gly Phe Thr Arg 
370 375 380 

Ala Ala Leu val Asn Phe Leu Glu Ser Arg Asn lie Gly Thr Arg Leu 
385 390 395 400 

Leu Phe Gly Gly Asn lie Thr Arg His Pro Ala Phe Glu Gin val Arg 
405 410 415 

Tyr Arg lie Ala Asp Ala Leu Thr Asn Ser Asp lie Val Thr Asp Arg 
420 425 430 

Thr Phe Trp Val Gly val Tyr Pro Gly lie Thr Asp Gin Met lie Asp 
435 440 445 

Tyr Val val Glu Ser lie Ala Glu Phe val Ala Lys Ser Ser 
450 455 460 



<210> 33 
<211> 1158 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(1155) 

<223> ORF14; 4-ami notransf erase 

<400> 33 

gtq ate aac ctg cac cag ccg ate etc gqc acc gaa gaa etc gac gcg 48 
val lie Asn Leu His Gin Pro lie Leu Gly Thr Glu Glu Leu Asp Ala 
15 10 15 

ate gcg gag gtq ttc gec tec aac tgg ate gqg etc gqg ccg cgc acc 96 
lie Ala Glu val Phe Ala Ser Asn Trp lie Gly Leu Gly Pro Arg Thr 
20 25 30 

egg acg ttc gag gee gaa ttc gee cac cac ctg gqa gtq gat ccc gaa 144 
Arg Thr Phe Glu Ala Glu Phe Ala His His Leu Gly Val Asp Pro Glu 
35 40 45 

cag gtc gtq ttc etc aac teg gqg act gee gcg ctg ttc ctt acc gtq 192 
Gin val val Phe Leu Asn Ser Gly Thr Ala Ala Leu Phe Leu Thr Val 
50 55 60 
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cag gtq etc gac etc gqc cca gqc gac gac gtq gta ctt cct teg ata 240 
Gin Val Leu Asp Leu Gly Pro Gly Asp Asp Val val 



Asp Leu Gly Pro Gly Asp Asp Val val Leu Pro Ser lie 

65 70 75 80 

age ttc gtq gcg gcg gee aac gec ate gca tec tec ggt gee cgc ccg 

Ser Phe val Ala Ala Ala Asn Ala lie Ala Ser ser Gly Ala Arg Pro 

85 90 95 



tec gtq cac gqc ace get tgc gqa ace ttt ggt gac ctg gee acg tgg 
Ser Val His Gly Thr Ala Cys Gly Thr Phe Gly Asp Leu Ala Thr Trp 
165 170 175 



288 



gtq ttc tgc gac gtc gac ccc egg acg ttg aac ccc acg ctg gat gat 336 
val Phe Cys Asp Val Asp Pro Arg Thr Leu Asn Pro Thr Leu Asp Asp 
100 105 110 

gtq gcg agg gee ate acg ccg gcg ace aag gee gta ttg ctg etc cac 384 
val Ala Arg Ala lie Thr Pro Ala Thr Lys Ala Val Leu Leu Leu His 
115 120 125 

tat gqa gqa teg ccg gqa gaa gtc ace gcg ate gee gat ttc tgc cgt 432 
Tyr Gly Gly Ser Pro Gly Glu val Thr Ala lie Ala Asp Phe Cys Arg 
130 135 140 

gaa aag gqc etc atg etc ate gag gac tec gee tgc gcg gtq gca teg 480 
Glu Lys Gly Leu Met Leu lie Glu Asp Ser Ala Cys Ala Val Ala Ser 
145 150 155 160 



528 



agt ttc gat gcg atg aag ate ctg gtc acc ggg gat ggg gqc atg ttc 576 

Ser Phe Asp Ala Met Lys lie Leu val Thr Gly Asp Gly Gly Met Phe 
180 185 190 

tac gcg gcg gat ccg gag ctg gcg cac cgc gca aga cga etc gee tac 624 

Tyr Ala Ala Asp Pro Glu Leu Ala His Arg Ala Arg Arg Leu Ala Tyr 
195 200 205 

cac ggt ctt gag cag atg age gqa ttc gat teg gee aag tct tec aac 672 

His Gly Leu Glu Gin Met Ser Gly Phe Asp Ser Ala Lys Ser Ser Asn 
210 215 220 

cgc tgg tgg gat att cgc gtc gaa gac ate gqc cag egg ctg ate gqg 720 

Arg Trp Trp Asp lie Arg val Glu Asp lie Gly Gin Arg Leu lie Gly 

225 230 235 ~ 240 

aac gac atg acg gca gcg ctt gqc age gtq cag ctg cgc aaa ctg cca 768 

Asn Asp Met Thr Ala Ala Leu Gly Ser val Gin Leu Arg Lys Leu Pro 
245 250 255 

gaa ttc ate aac agg cgt aga gaa ate get acg cag tac gac egg ttg 816 

Glu Phe lie Asn Arg Arg Arg Glu lie Ala Thr Gin Tyr Asp Arg Leu 
260 ~ 265 270 

ctt tec gat gtq ccg ggt gtc etc eta ccg ccg acg eta ccg gat gqg 864 

Leu Ser Asp val Pro Gly Val Leu Leu Pro Pro Thr Leu Pro Asp Gly 
275 280 285 

cac gtc teg tea cac tac ttc tac tgg gtc cag ctg get ccg gag ate 912 

His val Ser Ser His Tyr Phe Tyr Trp val Gin Leu Ala Pro Glu lie 
290 295 300 

cgc gac cag gtq gcg cag caa atg ctg gaa cgc ggc ate tac acg age 960 

Arg Asp Gin Val Ala Gin Gin Met Leu Glu Arg Gly lie Tyr Thr Ser 
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305 310 315 320 

tac cgc tac ccg ccc ctg cac aag gtc ccc ate tac cgc gcg gac tgc 1008 
Tyr Arg Tyr Pro Pro Leu His Lys Val Pro lie Tyr Arg Ala Asp Cys 
325 330 335 

aag ctg cct tct gcg gag cac gec tgc cgc aga aca etc ctg eta cca 1056 
Lys Leu Pro Ser Ala Glu His Ala Cys Arg Arg Thr Leu Leu Leu Pro 
340 345 " 350 

ctg cac cca age ctt gac gac gec gag gtq cgc acg gtq get gac gag 1104 
Leu His Pro Ser Leu Asp Asp Ala Glu Val Arg Thr Val Ala Asp Glu 
355 360 ~ 365 

ttc cag aag gec gtc gaa cac cac ate age caa aga tea cca etc cga 1152 
Phe Gin Lys Ala val Glu His His He Ser Gin Arg ser Pro Leu Arg 
370 375 380 

aag tga 1158 

Lys 

385 

<210> 34 
<211> 385 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 34 

Val lie Asn Leu His Gin Pro lie Leu Gly Thr Glu Glu Leu Asp Ala 
15 10 15 

lie Ala Glu val Phe Ala Ser Asn Trp lie Gly Leu Gly Pro Arg Thr 
20 25 30 

Arg Thr Phe Glu Ala Glu Phe Ala His His Leu Gly Val Asp Pro Glu 
35 40 45 

Gin Val Val Phe Leu Asn Ser Gly Thr Ala Ala Leu Phe Leu Thr Val 
50 55 60 

Gin val Leu Asp Leu Gly Pro Gly Asp Asp val val Leu Pro Ser lie 
65 70 75 80 

ser Phe val Ala Ala Ala Asn Ala lie Ala Ser ser Gly Ala Arg Pro 
85 90 95 

Val Phe Cys Asp Val Asp Pro Arg Thr Leu Asn Pro Thr Leu Asp Asp 
100 105 110 

val Ala Arg Ala lie Thr Pro Ala Thr Lys Ala val Leu Leu Leu His 
115 120 125 

Tyr Gly Gly Ser Pro Gly Glu val Thr Ala lie Ala Asp Phe Cys Arg 
130 135 140 

Glu Lys Gly Leu Met Leu lie Glu Asp Ser Ala Cys Ala Val Ala Ser 
145 150 155 160 

Ser val His Gly Thr Ala Cys Gly Thr Phe Gly Asp Leu Ala Thr Trp 
165 170 175 

Ser Phe Asp Ala Met Lys lie Leu val Thr Gly Asp Gly Gly Met Phe 
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Tyr Ala Ala Asp Pro Glu Leu Ala His Arg Ala Arg Arg Leu Ala Tyr 
195 200 205 

His Gly Leu Glu Gin Met Ser Gly Phe Asp Ser Ala Lys Ser Ser Asn 
210 215 220 

Arg Trp Trp Asp lie Arg val Glu Asp lie Gly Gin Arg Leu lie Gly 
225 230 235 240 

Asn Asp Met Thr Ala Ala Leu Gly Ser val Gin Leu Arg Lys Leu Pro 
245 250 255 

Glu Phe lie Asn Arg Arg Arg Glu lie Ala Thr Gin Tyr Asp Arg Leu 
260 " ~ 265 270 

Leu Ser Asp val Pro Gly val Leu Leu Pro Pro Thr Leu Pro Asp Gly 
275 280 285 

His Val Ser ser His Tyr Phe Tyr Trp Val Gin Leu Ala Pro Glu lie 
290 295 300 

Arg Asp Gin val Ala Gin Gin Met Leu Glu Arg Gly lie Tyr Thr Ser 
305 310 315 320 

Tyr Arg Tyr Pro Pro Leu His Lys val Pro lie Tyr Arg Ala Asp Cys 
325 330 ~ 335 

Lys Leu Pro Ser Ala Glu His Ala Cys Arg Arg Thr Leu Leu Leu Pro 
340 345 ~ 350 

Leu His Pro Ser Leu Asp Asp Ala Glu val Arg Thr Val Ala Asp Glu 
355 360 365 

Phe Gin Lys Ala Val Glu His His lie Ser Gin Arg Ser Pro Leu Arg 
370 375 380 

Lys 
385 



<210> 35 
<211> 750 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(747) 

<223> ORF15; N-dimethyl transferase 
<400> 35 

atg teg cgc gtq age gac aca ttc gca gaa acc tec teg gta tac age 48 
Met Ser Arg val Ser Asp Thr Phe Ala Glu Thr ser Ser val Tyr Ser 
1 ~ 5 10 15 

cca gat cat gec gac ate tac gac gcg ate cac tec gcg cgt gqc egg 96 
Pro Asp His Ala Asp lie Tyr Asp Ala lie His Ser Ala Arg Gly Arg 
20 25 30 

gac tgg gca gee gag gee ggg gaa gta gtc cag etc gta cgc acc agg 144 
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Asp Trp Ala Ala Glu Ala Gly Glu val val Gin Leu val Arg Thr Arg 
35 40 45 

ctg ccc gaa gca cag tec eta etc gac gtc gec tgt gqg acc gqg gcg 192 
Leu Pro Glu Ala Gin Ser Leu Leu Asp Val Ala Cys Gly Thr Gly Ala 
50 55 60 

cac eta gag cga ttc cgt gee gaa tac gcg aag gtc gcg gqg ctt gaa 240 
His Leu Glu Arg Phe Arg Ala Glu Tyr Ala Lys Val Ala Gly Leu Glu 
65 70 75 80 

ctg tec gat gcg atg egg gag ate gcg ate aga cga gtc cct gag gta 288 
Leu ser Asp Ala Met Arg Glu He Ala lie Arg Arg val Pro Glu Val 
85 90 95 

ccg att cac ate gqt gac ate cgc gat ttc gac etc gqc gag cca ttc 336 
Pro lie His lie Gly Asp lie Arg Asp Phe Asp Leu Gly Glu Pro Phe 
100 105 110 

gac gtc ate acc tgc ctg tgc ttt acc gcg get tac atg egg acc gtt 384 
Asp val lie Thr Cys Leu Cys Phe Thr Ala Ala Tyr Met Arg Thr val 
115 120 125 

gac gac ctg cga cgc gtq acg egg aac atg gee egg cac ctg gec cct 432 
Asp Asp Leu Arg Arg Val Thr Arg Asn Met Ala Arg His Leu Ala Pro 
130 ~ 135 140 

gqt gqa gtc gcg gtc ate gaa ccc tgg tgg ttt ccc gac aag ttc ate 480 
Gly Gly Val Ala Val lie Glu Pro Trp Trp Phe Pro Asp Lys Phe lie 
145 150 155 160 

gac gqg ttc gtc acc gqa gec gtc gcg cac cac gqc gag egg gtq ate 528 
Asp Gly Phe Val Thr Gly Ala Val Ala His His Gly Glu Arg val lie 
165 170 175 

age egg eta teg cac teg gtc ctg gag gqc cgt acg age egg atg acc 576 
ser Arg Leu Ser His Ser val Leu Glu Gly Arg Thr Ser Arg Met Thr 
180 185 190 

gtc cgc tac aca gtc gee gaa ccc acc gqg ate egg gac ttc aca gag 624 
val Arg Tyr Thr val Ala Glu Pro Thr Gly lie Arg Asp Phe Thr Glu 
195 200 205 

ttc gaa ate etc teg ctg ttc act gag gac gag tac acc gee gcg etc 672 
Phe Glu lie Leu Ser Leu Phe Thr Glu Asp Glu Tyr Thr Ala Ala Leu 
210 215 220 

gaa gac gca gqg ate cgc gcg gaa tac ctt cct gqa gca ccg aac gqc 720 
Glu Asp Ala Gly lie Arg Ala Glu Tyr Leu Pro Gly Ala Pro Asn Gly 
225 230 235 240 

cga gqc ctg ttc gtc gqa ate cgc aac tga 750 
Arg Gly Leu Phe val Gly lie Arg Asn 
245 

<210> 36 
<211> 249 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 36 

Met Ser Arg val Ser Asp Thr Phe Ala Glu Thr ser Ser Val Tyr Ser 
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10 15 

He Tyr Asp Ala lie His Ser Ala Arg Gly Arg 
25 30 

Ala Gly Glu val val Gin Leu val Arg Thr Arg 
40 45 

Ser Leu Leu Asp val Ala Cys Gly Thr Gly Ala 
55 60 

Arg Ala Glu Tyr Ala Lys Val Ala Gly Leu Glu 
70 ' 75 80 

Arg Glu lie Ala lie Arg Arg val Pro Glu val 
90 95 

Asp lie Arg Asp Phe Asp Leu Gly Glu Pro Phe 
105 110 

Leu Cys Phe Thr Ala Ala Tyr Met Arg Thr val 
120 125 

val Thr Arg Asn Met Ala Arg His Leu Ala Pro 
135 140 

lie Glu Pro Trp Trp Phe Pro Asp Lys Phe lie 
150 155 160 

Gly Ala val Ala His His Gly Glu Arg val lie 
170 175 

Ser Val Leu Glu Gly Arg Thr Ser Arg Met Thr 
185 190 

Ala Glu Pro Thr Gly lie Arg Asp Phe Thr Glu 
200 205 

Leu Phe Thr Glu Asp Glu Tyr Thr Ala Ala Leu 
215 220 

Arg Ala Glu Tyr Leu Pro Gly Ala Pro Asn Gly 
230 235 240 

Gly lie Arg Asn 



1 5 

Pro Asp His Ala Asp 
20 

Asp Trp Ala Ala Glu 
35 

Leu Pro Glu Ala Gin 
50 

His Leu Glu Arg Phe 
65 

Leu Ser Asp Ala Met 
85 

Pro lie His lie Gly 
100 

Asp val lie Thr cys 
115 

Asp Asp Leu Arg Arg 
130 

Gly Gly val Ala val 
145 

Asp Gly Phe val Thr 
165 

Ser Arg Leu Ser His 
180 

Val Arg Tyr Thr Val 
195 

Phe Glu lie Leu Ser 
210 

Glu Asp Ala Gly lie 
225 

Arg Gly Leu Phe val 
245 



<210> 37 
<211> 726 
<212> DNA 
<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(723) 

<223> ORF16; 3, 4- reductase 

<400> 37 

atg age gaa cag acg att gca ctg 
Met Ser Glu Gin Thr lie Ala Leu 
1 5 



gtc acc ggc gca aac aag gga ate 
val Thr Gly Ala Asn Lys Gly lie 
10 15 
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gqa tac gag ate gcg gec gqg etc gqc gcg ctg gqg tgg age gtc gqa 96 

Gly Tyr Glu lie Ala Ala Gly Leu Gly Ala Leu Gly Trp Ser Val Gly 

20 25 30 

ate gqg gca egg gac cac cag cgc gqg gag gat gec gtq gcg aaa ttg 144 

lie Gly Ala Arg Asp His Gin Arg Gly Glu Asp Ala Val Ala Lys Leu 
35 40 45 

cgt gcg gac gqc gtc gat gcg ttc gcg gta tec ctg gac gtq aca gac 192 

Arg Ala Asp Gly Val Asp Ala Phe Ala val ser Leu Asp vat Thr Asp 
50 55 60 

gac gcg age gtc gcg get get gcg get ctg etc gag gag cgc gee gqc 240 

Asp Ala ser val Ala Ala Ala Ala Ala Leu Leu Glu Glu Arg Ala Gly 

65 70 75 ~ 80 

egg etc gat gtq ctg gtt aat aac gec gqc ate gee gqg gca tgg ccg 288 

Arg Leu Asp Val Leu val Asn Asn Ala Gly lie Ala Gly Ala Trp Pro 

85 90 95 

gag gag ccc teg ace gtc aca ccg gcg age etc egg gcg gtq gtq gag 336 

Glu Glu Pro Ser Thr Val Thr Pro Ala Ser Leu Arg Ala Val Val Glu 

100 105 110 

ace aac gtq ate gqc gtc gtt egg gtt acc aac get atg ctg ccg ttg 384 

Thr Asn val lie Gly val val Arg val Thr Asn Ala Met Leu Pro Leu 
115 120 125 

eta cgc cgc tec gag cgc ccg egg ate gtc aac cag tec age cac gtc 432 

Leu Arg Arg Ser Glu Arg Pro Arg lie Val Asn Gin Ser Ser His val 
130 135 140 

get tec ctg acc ttg caa acc acg ccg gqc gtc gac etc gqc gqg ate 480 

Ala Ser Leu Thr Leu Gin Thr Thr Pro Gly Val Asp Leu Gly Gly lie 

145 150 155 160 

age gqa gee tac tea ccg teg aag acg ttc etc aac gcg ate acc ate 528 

Ser Gly Ala Tyr Ser Pro Ser Lys Thr Phe Leu Asn Ala lie Thr lie 

165 170 175 

cag tac gee aag gaa etc age gat acc aac ate aaa ate aac aac gec 576 

Gin Tyr Ala Lys Glu Leu Ser Asp Thr Asn lie Lys lie Asn Asn Ala 

180 185 190 



tgc ccc gqc tac gtc gcg acc gac ctt aac gqc ttc cac gqa acc age 624 
Cys Pro Gly Tyr val Ala Thr Asp Leu Asn Gly Phe His Gly Thr Ser 
195 200 205 



acg ccg gca gac gqt gec agg ate gec att egg etc gec acg ctg cca 672 
Thr Pro Ala Asp Gly Ala Arg lie Ala lie Arg Leu Ala Thr Leu Pro 
210 215 220 

gac gac gqc ccg acc gqa gqc atg ttc gac gac gec gqg aat gtq ccc 720 
Asp Asp Gly Pro Thr Gly Gly Met Phe Asp Asp Ala Gly Asn Val Pro 
225 230 235 240 

tgg tga 726 
Trp 

<210> 38 
<211> 241 
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<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 38 

Met Ser Glu Gin Thr lie Ala Leu Val Thr Gly Ala Asn Lys Gly lie 
1 5 10 15 

Gly Tyr Glu lie Ala Ala Gly Leu Gly Ala Leu Gly Trp Ser Val Gly 
20 25 30 

lie Gly Ala Arg Asp His Gin Arg Gly Glu Asp Ala val Ala Lys Leu 
35 40 45 

Arg Ala Asp Gly Val Asp Ala Phe Ala Val Ser Leu Asp val Thr Asp 
50 55 60 

Asp Ala ser Val Ala Ala Ala Ala Ala Leu Leu Glu Glu Arg Ala Gly 
65 70 75 80 

Arg Leu Asp val Leu val Asn Asn Ala Gly lie Ala Gly Ala Trp Pro 
85 90 95 

Glu Glu Pro Ser Thr Val Thr Pro Ala Ser Leu Arg Ala val Val Glu 
100 105 110 

Thr Asn val lie Gly val val Arg val Thr Asn Ala Met Leu Pro Leu 
115 120 125 

Leu Arg Arg ser Glu Arg Pro Arg lie val Asn Gin Ser ser His val 
130 135 ~ 140 

Ala Ser Leu Thr Leu Gin Thr Thr Pro Gly Val Asp Leu Gly Gly lie 
145 150 155 ' 160 

Ser Gly Ala Tyr Ser Pro Ser Lys Thr Phe Leu Asn Ala lie Thr lie 
165 170 175 

Gin Tyr Ala Lys Glu Leu Ser Asp Thr Asn lie Lys lie Asn Asn Ala 
180 185 190 

Cys Pro Gly Tyr val Ala Thr Asp Leu Asn Gly Phe His Gly Thr Ser 
195 200 205 

Thr Pro Ala Asp Gly Ala Arg lie Ala lie Arg Leu Ala Thr Leu Pro 
210 215 220 

Asp Asp Gly Pro Thr Gly Gly Met Phe Asp Asp Ala Gly Asn Val Pro 
225 230 235 240 

Trp 



<210> 39 
<211> 837 
<212> DNA 

<213> saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(834) 

<223> ORF17; transcription regulator 
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<400> 39 

atg gag acg egg gag ttg egg tac ttc gtt gca gtc gec gag gag ttg 48 
Met Glu Thr Arg Glu Leu Arg Tyr Phe Val Ala Val Ala Glu Glu Leu 
1 5 10 15 

cac ttc gqc egg gec gec cag cgc ctg gqc ate gec cag ccg ccg ctg 96 
His Phe Gly Arg Ala Ala Gin Arg Leu Gly lie Ala Gin Pro Pro Leu 
20 25 30 

teg egg acg ate gee cag etc gag caa cga etc gqa gtc gtq ttg ctg 
ser Arg Thr lie Ala Gin Leu Glu Gin Arg Leu Gly Val val Leu Leu 
35 40 ~ 45 



144 



caa cgc ace age cgc aaa gtc teg etc acc gaa gec gqg gca atg ctg 192 

Gin Arg Thr Ser Arg Lys Val Ser Leu Thr Glu Ala Gly Ala Met Leu 
50 55 60 

ctg acc gaa gqc egg gcg ate etc gqc gcg ctg gca gca gec gag cga 240 

Leu Thr Glu Gly Arg Ala lie Leu Gly Ala Leu Ala Ala Ala Glu Arg 

65 70 75 80 

cgc acc cag cgt gec gcg acg age cag ccc teg eta gtc ctg get gee 288 

Arg Thr Gin Arg Ala Ala Thr Ser Gin Pro Ser Leu Val Leu Ala Ala 

85 90 95 

aag gee gqc gee tec ggt gag ctg ctg gcg aag ttg etc gac gcg tac 336 

Lys Ala Gly Ala Ser Gly Glu Leu Leu Ala Lys Leu Leu Asp Ala Tyr 

100 105 110 

gee gee gag ccg gqa gee gtq gee gtc gac ctg ctg etc tgc gaa tec 384 

Ala Ala Glu Pro Gly Ala val Ala Val Asp Leu Leu Leu Cys Glu Ser 

115 120 125 

cag ccc cag aaa acg ctg cat gac gqc egg gec gac gtq gcg ctg ttg 432 

Gin Pro Gin Lys Thr Leu His Asp Gly Arg Ala Asp Val Ala Leu Leu 
130 135 140 

cat caa ccc ttc gac ccg acg gec gaa etc gac ate gaa att ctg aac 480 

His Gin Pro Phe Asp Pro Thr Ala Glu Leu Asp lie Glu lie Leu Asn 

145 150 155 160 

acc gag caa caa gtc gec att ctt ccg acc teg cat ccg ctt gec age 528 

Thr Glu Gin Gin Val Ala lie Leu Pro Thr Ser His Pro Leu Ala ser 

165 170 175 

gag ccc cat gta egg atg gcg gat gtc age tea ctg ccg gat etc ccg 576 

Glu Pro His val Arg Met Ala Asp val Ser Ser Leu Pro Asp Leu Pro 

180 ~ 185 190 

ctt gcg cgc tgg ccc gqc ccc gac gqc gtc tat cca gat gqc ccc gqc 624 

Leu Ala Arg Trp Pro Gly Pro Asp Gly Val Tyr Pro Asp Gly Pro Gly 

195 200 205 

gtq gaa gta cgc aac cag acg caa ctg ttc caa atg ate gca etc gqc 672 

val Glu val Arg Asn Gin Thr Gin Leu Phe Gin Met lie Ala Leu Gly 
210 215 220 

cgc act acc gtq gtc atg ccc gaa tec agt cgc gtc aac ctg etc gaa 720 

Arg Thr Thr val val Met Pro Glu Ser Ser Arg Val Asn Leu Leu Glu 

225 230 235 240 

ggc etc gec gec gta ccg gtt eta gac gcg ccg gac gtg acg aca gtc 768 
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Gly Leu Ala Ala Val Pro Val Leu Asp Ala Pro Asp Val Thr Thr Val 
245 250 255 

ate gec tgg ccg ccc cac age cgc tec cga gca etc gee ggc ttg gtc 816 
lie Ala Trp Pro Pro His Ser Arg Ser Arg Ala Leu Ala Gly Leu Val 
260 265 270 

cgc gtq gee aca etc etc taa 837 
Arg val Ala Thr Leu Leu 
275 

<210> 40 
<211> 278 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 40 

Met Glu Thr Arg Glu Leu Arg Tyr Phe val Ala val Ala Glu Glu Leu 
1 5 10 15 

His Phe Gly Arg Ala Ala Gin Arg Leu Gly lie Ala Gin Pro Pro Leu 
20 25 30 

Ser Arg Thr lie Ala Gin Leu Glu Gin Arg Leu Gly Val Val Leu Leu 
35 40 ~ 45 

Gin Arg Thr Ser Arg Lys Val ser Leu Thr Glu Ala Gly Ala Met Leu 
50 55 60 

Leu Thr Glu Gly Arg Ala lie Leu Gly Ala Leu Ala Ala Ala Glu Ar 
65 70 75 8 

Arg Thr Gin Arg Ala Ala Thr Ser Gin Pro Ser Leu Val Leu Ala Ala 
85 90 95 

Lys Ala Gly Ala Ser Gly Glu Leu Leu Ala Lys Leu Leu Asp Ala Tyr 
100 105 110 

Ala Ala Glu Pro Gly Ala Val Ala Val Asp Leu Leu Leu Cys Glu Ser 
115 120 125 

Gin Pro Gin Lys Thr Leu His Asp Gly Arg Ala Asp val Ala Leu Leu 
130 135 140 

His Gin Pro Phe Asp Pro Thr Ala Glu Leu Asp lie Glu lie Leu Asn 
145 150 155 160 

Thr Glu Gin Gin val Ala lie Leu Pro Thr ser His Pro Leu Ala Ser 
165 170 175 

Glu Pro His val Arg Met Ala Asp val Ser Ser Leu Pro Asp Leu Pro 
180 185 190 

Leu Ala Arg Trp Pro Gly Pro Asp Gly val Tyr Pro Asp Gly Pro Gly 
195 200 205 

Val Glu val Arg Asn Gin Thr Gin Leu Phe Gin Met lie Ala Leu Gly 
210 215 220 

Arg Thr Thr val val Met Pro Glu Ser Ser Arg Val Asn Leu Leu Glu 
225 230 235 240 
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Gly Leu Ala Ala val Pro val Leu Asp Ala Pro Asp Val Thr Thr val 
245 250 255 

lie Ala Trp Pro Pro His Ser Arg Ser Arg Ala Leu Ala Gly Leu val 
260 265 270 

Arg Val Ala Thr Leu Leu 
275 



<210> 41 
<211> 7788 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(7785) 

<223> ORF18; polyketide synthase 

<400> 41 

atg age gaa gec gqg aac ctg ata gec gtc ate gqa ctg tec tgc cgc 48 

Met Ser Glu Ala Gly Asn Leu lie Ala val lie Gly Leu ser Cys Arg 
1 5 10 15 

eta ccc cag gcg cct gae ccg get tec ttc tgg egg ttg ctg cgc ace 96 
Leu Pro Gin Ala Pro Asp Pro Ala ser Phe Trp Arg Leu Leu Arg Thr 
20 25 30 

gqa acg gae gee ate ace acg gtc ccg gaa gqg egg tgg gqc gac ccg 144 
Gly Thr Asp Ala lie Thr Thr val Pro Glu Gly Arg Trp Gly Asp Pro 
35 40 45 



ttg cct gqt egg gat gcg ccc aag gqc ccg gaa tgg gqt gqt ttc ctg 
Leu Pro Gly Arg Asp Ala Pro Lys Gly Pro Glu Trp Gly Gly Phe Leu 
50 ~ 55 60 



act gec gee gqt gtq ttc atg gqg gcg ate tct gac gac tac gec gec 
Thr Ala Ala Gly Val Phe Met Gly Ala lie Ser Asp Asp Tyr Ala Ala 
115 120 125 



192 



get gat gtc gac tgc ttc gat ccc gag ttc ttc gqg ate teg ccg cga 240 

Ala Asp val Asp Cys Phe Asp Pro Glu Phe Phe Gly lie ser Pro Arg 

65 70 75 80 

gaa gcg gca gec gtq gat ccc cag cag agg ctg get ctg gag etc gee 288 

Glu Ala Ala Ala val Asp Pro Gin Gin Arg Leu Ala Leu Glu Leu Ala 

85 90 95 

tgg gag gca etc gaa gac gee gqt ate ccc gee gqc gag ctg cgc gqt 336 

Trp Glu Ala Leu Glu Asp Ala Gly lie Pro Ala Gly Glu Leu Arg Gly 

100 105 110 



384 



ctg ctg cgc gag age ccg ccg gaa gtq get gcg cag tac cgc etc ace 432 

Leu Leu Arg Glu Ser Pro Pro Glu Val Ala Ala Gin Tyr Arg Leu Thr 
130 135 140 

gqc ace cat cga age ctg ate gec aac cgc gtq tec tat gtq etc gqc 480 

Gly Thr His Arg Ser Leu lie Ala Asn Arg val Ser Tyr Val Leu Gly 
145 150 155 160 

ctg cgc ggg cca age ctg acg gtg gat tea ggt cag tec teg tec ctg 528 
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Leu Arg Gly Pro Ser Leu Thr Val Asp Ser Gly Gin Ser Ser Ser Leu 
165 170 175 

gtc ggc gtg cat etc gec age gag age ctg cga egg ggt gag tgc acg 
val Gly val His Leu Ala Ser Glu Ser Leu Arg Arg Gly Glu Cys Thr 
180 185 190 



ate tac tgc etc ate cgc ggc age gcg gtc aac aac gat ggg ggc ggt 
lie Tyr Cys Leu lie Arg Gly Ser Ala val Asn Asn Asp Gly Gly Gly 
260 " 265 270 



ly Leu Thr Val Pro Ala Ala Asp Ala Gin Ala Glu Leu Leu Arg 

275 280 285 

cag gca tac egg aac gcg ggc gtc gac ccg gee gee gtg cag tat gtc 

Gin Ala Tyr Arg Asn Ala Gly Val Asp Pro Ala Ala Val Gin Tyr val 
290 295 300 



gee etc gga get gtc ctg ggg gcg gcg aga egg ccc ggc gac gag ctg 
Ala Leu Gly Ala val Leu Gly Ala Ala Arg Arg Pro Gly Asp Glu Leu 
325 330 335 

cgt gtg ggg teg gcg aag ace aac gtc ggc cat ctg gaa gca gcg gcg 
Arg val Gly Ser Ala Lys Thr Asn Val Gly His Leu Glu Ala Ala Ala 
340 345 350 



age gag ggg ccg ctg ctg gee ggc gtc age gee ttc gga atg gga ggc 

Ser Glu Gly Pro Leu Leu Ala Gly val Ser Ala Phe Gly Met Gly Gly 
405 410 415 
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ate gca etc gee ggc ggc gtg aac etc aac ctg gee gee gag age aac 624 

lie Ala Leu Ala Gly Gly Val Asn Leu Asn Leu Ala Ala Glu Ser Asn 

195 200 205 

age get ctg atg gac ttc ggc gcg etc tec ccg gac ggt cgc tgc ttc 672 

Ser Ala Leu Met Asp Phe Gly Ala Leu Ser Pro Asp Gly Arg Cys Phe 

210 215 220 

acc ttc gat gtg egg gcg aac ggt tac gtc cgt ggt gag ggc ggc ggc 

Thr Phe Asp Val Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly Gly Gly 

225 230 235 240 



720 



ctt gtc gtg ctg aag aag gec gat cag gcg cac gec gat ggc gac egg 768 
Leu val val Leu Lys Lys Ala Asp Gin Ala His Ala Asp Gly Asp Arg 
245 250 255 



816 



gec ggg etc acc gtt ccg gcg gcg gac gec cag gcg gag ctg ctg cgc 864 
Ala Gl\ ' " " " " " 



912 



gag etc cac ggc age gcg acc agg gtc ggg gat ccc gtc gaa gca gca 960 
Glu Leu His Gly Ser Ala Thr Arg Val Gly Asp Pro val Glu Ala Ala 
305 310 315 320 



1008 



1056 



ggc gtc acc ggg ttg ctg aag acc gca etc age ate tgg cac cgc gaa 1104 
Gly val Thr Gly Leu Leu Lys Thr Ala Leu Ser lie Trp His Arg Glu 
355 360 365 

ctg ccg ccg agt ctt cat ttc acc gec ccc aac ccg gaa ate ccg ctg 1152 
Leu Pro Pro Ser Leu His Phe Thr Ala Pro Asn Pro Glu lie Pro Leu 
370 375 380 

gac gaa ttg aac eta cgc gtc cag cgt gat ctg egg ccg tgg ccg gag 1200 
Asp Glu Leu Asn Leu Arg val Gin Arg Asp Leu Arg Pro Trp Pro Glu 
385 390 395 ' 400 



1248 
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acg aac tgc cac ctg gtq etc tec gqc acg tec egg gtq gag cga egg 1296 

Thr Asn Cys His Leu VaT Leu Ser Gly Thr Ser Arg Val Glu Arg Arg 

420 425 " 430 

cgc agt gqa ccc get gag gcg acc atg ccg tgg gtc ttg teg gec aga 1344 

Arg Ser Gly Pro Ala Glu Ala Thr Met Pro Trp Val Leu Ser Ala Arg 
435 440 445 

aca ccg gtc gca ttg cgt gcg cag gcg gcg cgc ttg cac acg cac etc 1392 

Thr Pro val Ala Leu Arg Ala Gin Ala Ala Arg Leu His Thr His Leu 
450 455 460 

aat acg gec gqt caa agt ccg ttg gac gtc gec tac tea ctg gcg acc 1440 

Asn Thr Ala Gly Gin ser Pro Leu Asp val Ala Tyr ser Leu Ala Thr 

465 470 475 480 

act cga tec gcg ctg ccg cac egg gee gcg ctg gtc gcg gac gac gaa 1488 

Thr Arg Ser Ala Leu Pro His Arg Ala Ala Leu val Ala Asp Asp Glu 

485 ~ 490 495 

ccg aaa ctg etc gec gqg ttg aag gee etc get gac gqc gac gac gcg 1536 

Pro Lys Leu Leu Ala Gly Leu Lys Ala Leu Ala Asp Gly Asp Asp Ala 

500 505 510 



ccc acg ctg tgc cac gqc gcg act tec gqc gag egg gca gcg gtc ttc 

Pro Thr Leu Cys His Gly Ala Thr Ser Gly Glu Arg Ala Ala val Phe 
515 520 525 

gtc ttt ccc gqa cag gqc age cag tgg ate gqg atg gqt agg cag ctg 

val Phe Pro Gly Gin Gly Ser Gin Trp lie Gly Met Gly Arg Gin Leu 
530 535 540 



gcg gtq cgc age agg ctt ctg acg gcg ctg gec ggc agt gqc gcg atg 
Ala val Arg Ser Arg Leu Leu Thr Ala Leu Ala Gly Ser Gly Ala Met 
645 650 655 



1584 



1632 



etc gaa acc tec gag gtt ttc gcg gcg teg atg teg gac tgc gee gac 1680 

Leu Glu Thr Ser Glu Val Phe Ala Ala Ser Met Ser Asp Cys Ala Asp 

545 550 555 560 

gca ttg gcg ccg cac ctg gat tgg tec ctg ctg gat gtq ctg cgc aac 1728 

Ala Leu Ala Pro His Leu Asp Trp Ser Leu Leu Asp val Leu Arg Asn 

565 570 575 

gcg gee gqc get gcg cac ctt gac cac gac gat gtc gtc cag ccc gcg 1776 

Ala Ala Gly Ala Ala His Leu Asp His Asp Asp val val Gin Pro Ala 
580 585 590 

ctg ttc gec ate atg gtc teg etc gcg gag etc tgg cgt teg tgg gqc 1824 

Leu Phe Ala lie Met Val Ser Leu Ala Glu Leu Trp Arg Ser Trp Gly 
595 600 605 

gtq cgt ccg gtq gcg gtc gtc gqg cac teg cag gqg gag ate gcg gcg 

val Arg Pro Val Ala val Val Gly His Ser Gin Gly Glu lie Ala Ala 
610 615 620 



1872 



gee tgc gtc gec gqg gee ctg tec gtc cgc gat gee gee agg gtq gtq 1920 
Ala Cys val Ala Gly Ala Leu Ser Val Arg Asp Ala Ala Arg Val Val 
625 630 635 ~ 640 



1968 



gee teg ttg cag cat ccc gec gaa gag gtq egg caa ate ctg ttg ccc 2016 
Ala ser Leu Gin His Pro Ala Glu Glu val Arg Gin lie Leu Leu Pro 
660 665 670 
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tgg cgc gat egg ate ggc gtg gcg ggg gtg aac gqa ccg teg teg ace 

Trp Arg Asp Arg lie Gly val Ala Gly val Asn Gly Pro Ser ser Thr 
675 680 685 

ctg gtg tea gqg gac egg gag gcg atg gcg gaa ctg ctg gee gag tgc 

Leu val ser Gly Asp Arg Glu Ala Met Ala Glu Leu Leu Ala Glu Cys 
690 695 700 



2064 



2112 



gca gac cga gag etc egg atg cgc egg att ccc gtt gaa tac gec tec 2160 

Ala Asp Arg Glu Leu Arg Met Arg Arg lie Pro Val Glu Tyr Ala Ser 

705 710 715 720 

cat teg cct cac ate gag gtt gtc egg gat gag ctg ctg gqg ctg ttg 2208 

His ser Pro His lie Glu val val Arg Asp Glu Leu Leu Gly Leu Leu 

725 730 735 

gcg ccg gtc gaa ccc agg acg gqa age ate ccg ate tat teg acg acg 2256 

Ala Pro Val Glu Pro Arg Thr Gly Ser He Pro lie Tyr Ser Thr Thr 

740 745 750 

acc gqg gac ctg ctg gac egg ccg atg gac gee gac tac tgg tac cgc 2304 

Thr Gly Asp Leu Leu Asp Arg Pro Met Asp Ala Asp Tyr Trp Tyr Arg 

755 760 765 

aac ctt cgt caa ccg gtg ctg ttc gaa gcg gec gtc gag gec ctg ttg 2352 

Asn Leu Arg Gin Pro val Leu Phe Glu Ala Ala val Glu Ala Leu Leu 
770 775 780 

aag egg gqg tac gac gca ttc ate gag ate age cca cac ccg gtg ctg 2400 

Lys Arg Gly Tyr Asp Ala Phe lie Glu lie Ser Pro His Pro val Leu 

785 790 795 800 

act gcg aac ate cag gaa acc gee gtg cga gca ggg egg gag gta gtg 

Thr Ala Asn lie Gin Glu Thr Ala Val Arg Ala Gly Arg Glu Val Val 

805 810 815 

gcg etc ggg aca etc cgc cgc ggc gaa ggt ggc atg egg cag gcg ctg 

Ala Leu Gly Thr Leu Arg Arg Gly Glu Gly Gly Met Arg Gin Ala Leu 

820 ~ 825 830 



2448 



2496 



acg teg ctg gee aga gca cac gta cac gga gtg gec gcg gac tgg cac 2544 

Thr Ser Leu Ala Arg Ala His Val His Gly Val Ala Ala Asp Trp His 

835 ~ 840 845 

gcg gtc ttc gec ggt acc ggg gcg cag egg gtc gac ctg ccg acg tac 2592 

Ala Val Phe Ala Gly Thr Gly Ala Gin Arg Val Asp Leu Pro Thr Tyr 
850 855 ~ 860 

gec ttt cag cga cag cgc tac tgg ctg gac gcg aag ctt ccc gac gtc 2640 

Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ala Lys Leu Pro Asp val 
865 870 875 880 

gec atg ccc gag age gac gtg teg acg gcg ttg egg gaa aag ctg egg 2688 

Ala Met Pro Glu ser Asp val Ser Thr Ala Leu Arg Glu Lys Leu Arg 
885 890 895 

tct teg ccg agg gcg gac gtg gac teg acg acc etc acg atg ate egg 2736 

Ser ser Pro Arg Ala Asp Val Asp Ser Thr Thr Leu Thr Met lie Arg 

900 905 910 

gca cag gca gec gtg gtc etc ggc cac tec gat ccg aaa gag gtg gac 2784 

Ala Gin Ala Ala Val Val Leu Gly His Ser Asp Pro Lys Glu val Asp 
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915 920 925 

ccg gat egg acg ttc aag gac ctg ggc ttc gat tec teg atg gtg gtc 2832 
Pro Asp Arg Thr Phe Lys Asp Leu Gly Phe Asp Ser Ser Met Val val 
930 ~ 935 940 

gag ctg tgc gac cgc eta aac gee gee aca ggt ctg cga etc gca ccg 2880 
Glu Leu cys Asp Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Ala Pro 
945 950 955 960 

age gtc gtt ttc gac tgt cct acg ccg gac aag etc gee cgc cag gta 2928 
ser val val Phe Asp cys Pro Thr Pro Asp Lys Leu Ala Arg Gin val 
965 970 975 

egg acg ttg ttg ttg ggc gag ccg get ccc atg acg tea cac egg ccg 2976 
Arg Thr Leu Leu Leu Gly Glu Pro Ala Pro Met Thr Ser His Arg Pro 
980 985 990 

gac tec gat gcg gac gag cct ate gee gtq ate ggg atg ggc tgt egg 3024 
Asp Ser Asp Ala Asp Glu Pro lie Ala Val lie Gly Met Gly Cys Arg 
995 1000 1005 



ttt ccg gqt ggg gtg tec teg ccc gag gag ttg tgg cag ttg gtc gee 
Phe Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Gin Leu Val Ala 
1010 1015 1020 



ly Arg Asp val val Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp 

1025 1030 1035 ~ 1040 

ctg gag cgt gcg ggg aca teg cac gtg cgc gee ggc ggg ttc ttg cat 

Leu Glu Arg Ala Gly Thr Ser His val Arg Ala Gly Gly Phe Leu His 

1045 1050 1055 



gaa gca gtc gaa cga ggc ggg ate aac ccg cag cac ctg cac gga agt 

Glu Ala val Glu Arg Gly Gly lie Asn Pro Gin His Leu His Gly Ser 
1090 1095 1100 

caa acc ggg gtc ttc gtc ggc gcg ace tec ctg gac tac ggg cca cgc 

Gin Thr Gly Val Phe Val Gly Ala Thr Ser Leu Asp Tyr Gly Pro Arg 

1105 1110 1115 1120 

ctg cac gaa gcg tec gag gag gcg gee ggg tac gtg etc acc ggc age 

Leu His Glu Ala Ser Glu Glu Ala Ala Gly Tyr val Leu Thr Gly ser 

1125 1130 1135 



ggc cct gcg gtg acg gtg gat acg gcg tgt teg teg teg ttg gtg gee 
Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
1155 1160 1165 



3072 



get ggg egg gac gtc gtg tec gag ttc ccg get gac cga ggt tgg gac 3120 
Ala Gly Arg Asp Val Val Ser Glu Phe Pro Ala Asp Arg Gl\ 



3168 



ggc gee ccg gat ttt gac ccc ggg ttc ttc egg att teg ccg cgc gag 3216 
Gly Ala Pro Asp Phe Asp Pro Gly Phe Phe Arg lie Ser Pro Arg Glu 
1060 1065 ~ 1070 

gcg ttg gcg atg gat cca cag cag egg ttg ctg ctg gaa ate gee tgg 3264 
Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu lie Ala Trp 
1075 1080 ~ 1085 



3312 



3360 



3408 



acc acg agt gtg gcg teg ggt egg gtt gcg tat teg ttc ggg ttc gag 3456 
Thr Thr Ser val Ala Ser Gly Arg Val Ala Tyr Ser Phe Giy Phe Glu 
1140 1145 1150 



3504 



ctg cat ttg gcg tgt cag teg ttg cgt teg ggt gag tgt gat ctg gcg 3552 
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Leu His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala 
1170 1175 1180 

ttg gcc gqt gqt gtq acc gtq atg gcc acg ccg gqg atg ttc gtq gag 3600 
Leu Ala Gly Gly val Thr Val Met Ala Thr Pro Gly Met Phe Val Glu 
1185 1190 1195 1200 

ttt teg egg cag cgt gqt ttg gcg ccg gat gqg egg tgc aag teg ttc 3648 
Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe 
1205 1210 1215 

gcg gag gcc gcc gac gqc acc gqc tgg tec gag gqt get gqc ctg gtt 3696 
Ala Glu Ala Ala Asp Gly Thr Gly Trp ser Glu Gly Ala Gly Leu Val 
1220 1225 1230 

eta ctg gag egg ttg teg gat gcc egg egg aat gqg cat gag gtq ctg 3744 
Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Glu val Leu 
1235 1240 ~ ~ 1245 

gcg gtt gtt cgt gqt agt gcg gtq aat cag gac gqt gcg teg aat gqt 3792 
Ala val val Arg Gly ser Ala Val Asn Gin Asp Gly Ala ser Asn Gly 
1250 " 1255 1260 

ttg acc gcg ccg aat gqt teg teg cag cag egg gtq att gcc cag gca 3840 
Leu Thr Ala Pro Asn Gly Ser Ser Gin Gin Arg Val lie Ala Gin Ala 
1265 1270 1275 1280 

ttg gcg agt gcg gqg ttg teg gtq tec gat gtq gat get gtq gag gcg 3888 
Leu Ala Ser Ala Gly Leu Ser val Ser Asp Val Asp Ala Val Glu Ala 
1285 1290 1295 

cat gqg acg gqc acg egg ctt gqt gat ccg ate gag gcg cag gcg ctg 3936 
His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu 
1300 1305 1310 

ate gcc acc tac gqc cag gqc egg ctt ccg gaa egg cca ttg tgg ttg 3984 
lie Ala Thr Tyr Gly Gin Gly Arg Leu Pro Glu Arg Pro Leu Trp Leu 
1315 1320 1325 

gqc teg atg aag teg aac ate gqt cac gcg cag gca get gcg gqg ata 4032 
Gly Ser Met Lys ser Asn lie Gly His Ala Gin Ala Ala Ala Gly lie 
1330 1335 1340 

gcc gqc gtc atg aag atg gtq atg gcg atg egg cac gqg cag eta ccg 4080 
Ala Gly val Met Lys Met val Met Ala Met Arg His Gly Gin Leu Pro 
1345 1350 1355 1360 

cgc acg ttg cac gtq gat gag ccg act tct gqg gtq gat tgg teg gcg 4128 
Arg Thr Leu His Val Asp Glu Pro Thr Ser Gly Val Asp Trp Ser Ala 
1365 1370 1375 

gqg acg gtt caa etc ctt acg gag aac acg ccc tgg ccc gqg agt gqt 4176 
Gly Thr Val Gin Leu Leu Thr Glu Asn Thr Pro Trp Pro Gly Ser Gly 
1380 1385 1390 

cgt gtt cgt egg gtq gqg gtq teg teg ttc gqg ate agt gqt act aac 4224 
Arg val Arg Arg val Gly val Ser Ser Phe Gly lie Ser Gly Thr Asn 
1395 " 1400 1405 

gcg cac gtc ate etc gaa cag ccc ccg gqa gtq ccg agt cag tct gcg 4272 
Ala His val lie Leu Glu Gin Pro Pro Gly val Pro ser Gin ser Ala 
1410 1415 1420 
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999 ccg gqt teg ggc tct gtc gtg gat gtt ccg gtg gtg ccg tgg atg 4320 
Gly Pro Gly Ser Gly ser val val Asp val Pro val val Pro Trp Met 
1425 1430 1435 1440 

gtq teg gqc aaa aca ccc gaa gcg eta tec gcg cag gca acg gcg ttg 4368 
val Ser Gly Lys Thr Pro Glu Ala Leu Ser Ala Gin Ala Thr Ala Leu 
1445 1450 1455 

atg acc tat ctg gac gag cga cct gat gtc tec teg ctg gat gtt ggg 4416 
Met Thr Tyr Leu Asp Glu Arg Pro Asp val Ser Ser Leu Asp Val Gly 
1460 ~ 1465 1470 

tac teg ctg gcg ttg aca egg teg gcg ctg gat gag cga gcg gtq gtq 4464 
Tyr Ser Leu Ala Leu Thr Arg Ser Ala Leu Asp Glu Arg Ala val val 
1475 1480 1485 

ctg ggg teg gac cgt gaa acg ttg ttg tgc ggt gtg aaa gcg ctg tct 4512 
Leu Gly ser Asp Arg Glu Thr Leu Leu Cys Gly val Lys Ala Leu Ser 
1490 1495 1500 

gec ggt cat gag get tct ggg ttg gtg acc gga tct gtg ggg get ggg 4560 
Ala Gly His Glu Ala Ser Gly Leu Val Thr Gly Ser Val Gly Ala Gly 
1505 1510 1515 1520 

gqc cgc ate ggg ttt gtg ttt tec ggt cag ggt ggt cag tgg ctg ggg 4608 
Gly Arg lie Gly Phe Val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly 
1525 1530 1535 

atg ggc egg ggg ctt tac egg get ttt ccg gtg ttc get get gee ttt 4656 
Met Gly Arg Gly Leu Tyr Arg Ala Phe Pro Val Phe Ala Ala Ala Phe 
1540 1545 1550 

gac gaa get tgt gee gag ctg gat gcg cat ctg ggc cag gaa ate ggg 4704 
Asp Glu Ala Cys Ala Glu Leu Asp Ala His Leu Gly Gin Glu lie Gly 
1555 1560 1565 

gtt egg gag gtg gtg tec ggt teg gat gcg cag ttg ctg gat egg acg 4752 
Val Arg Glu val val Ser Gly Ser Asp Ala Gin Leu Leu Asp Arg Thr 
1570 1575 1580 

ttg tgg gcg cag teg ggt ttg ttc gcg ttg cag gtg ggc ttg ctg aag 4800 
Leu Trp Ala Gin ser Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Lys 
1585 1590 1595 1600 

ttg ctg gat teg tgg ggg gtt egg ccg agt gtg gtg ttg ggg cat teg 4848 
Leu Leu Asp Ser Trp Gly Val Arg Pro Ser val val Leu Gly His Ser 
1605 1610 1615 

gtg ggc gag ttg gcg gcg gcg ttc gcg gcg ggt gtg gtg teg ttg teg 4896 
val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly val val Ser Leu Ser 
1620 1625 1630 

ggt gcg get egg ttg gtg gcg ggt cgt gee egg ttg atg cag gcg ttg 4944 
Gly Ala Ala Arg Leu Val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu 
1635 ~ 1640 " 1645 

ccg tct ggc ggt ggg atg ctg gcg gtg cct get ggt gag gag ctg ttg 4992 
Pro Ser Gly Gly Gly Met Leu Ala val Pro Ala Gly Glu Glu Leu Leu 
1650 ' 1655 1660 

tgg teg ttg ttg gec gat cag ggt gat cgt gtg ggg ate gee gcg gtc 5040 
Trp ser Leu Leu Ala Asp Gin Gly Asp Arg Val Gly lie Ala Ala val 
1665 1670 * 1675 1680 

Page 122 



LeA33955-US 

aac get gcg ggg teg gtg gtg etc tct ggt gat egg gat gtg etc gat 5088 
Asn Ala Ala Gly ser val val Leu Ser Gly Asp Arg Asp val Leu Asp 
1685 1690 1695 

gac ctt gee ggt egg ctg gac gqg caa gqg ate egg teg agg tgg ttg 5136 
Asp Leu Ala Gly Arg Leu Asp Gly Gin Gly lie Arg ser Arg Trp Leu 
1700 ~ 1705 1710 

egg gtq teg cat gcg ttt cat teg tat egg atg gat ccg atg ctg gcg 5184 
Arg val Ser His Ala Phe His Ser Tyr Arg Met Asp Pro Met Leu Ala 
1715 1720 ^ 1725 

gag ttc gec gaa ttg gca cga acc gtq gat tac egg cgt tgt gaa gtq 5232 
Glu Phe Ala Glu Leu Ala Arg Thr val Asp Tyr Arg Arg Cys Glu val 
1730 1735 1740 

ccg ate gtq teg acc ttg acc gga gac etc gat gac get ggc agg atg 5280 
pro He val ser Thr Leu Thr Gly Asp Leu Asp Asp Ala Gly Arg Met 
1745 1750 1755 1760 

age gqg ccc gac tac tgg gtq cgt cag gtq cga gag ccg gtc cgc ttc 5328 
Ser Gly Pro Asp Tyr Trp val Arg Gin Val Arg Glu Pro val Arg Phe 
1765 1770 1775 

gec gac ggt gtc cag gcg ctg gtc gag cac gat gtq gec acc gtt gtc 5376 
Ala Asp Gly val Gin Ala Leu val Glu His Asp Val Ala Thr val val 
1780 1785 1790 

gag etc ggt ccg gac ggg gcg ttg teg gcg ctg ate cag gaa tgt gtc 5424 
Glu Leu Gly Pro Asp Gly Ala Leu Ser Ala Leu lie Gin Glu Cys val 
1795 1800 1805 

gec gca tec gat cac gec ggg egg ctg age gcg gtc ccg gcg atg cgc 5472 
Ala Ala ser Asp His Ala Gly Arg Leu ser Ala val Pro Ala Met Arg 
1810 1815 1820 

agg aac cag gac gag gcg cag aag gtq atg acg gec ctg gca cac gtc 5520 
Arg Asn Gin Asp Glu Ala Gin Lys val Met Thr Ala Leu Ala His Val 
1825 1830 1835 1840 

cac gta cgt ggt ggt gcg gtg gac tgg egg teg ttc ttc gec ggt aca 5568 
His val Arg Gly Gly Ala val Asp Trp Arg ser Phe Phe Ala Gly Thr 
1845 1850 1855 

agg gcg aag caa ate gag ctg ccc acc tac gec ttc caa cga cag egg 5616 
Arg Ala Lys Gin lie Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg 
1860 1865 1870 

tac tgg ctg aac gcg ctg cgt gaa tct tec gee ggc gac atg ggc agg 5664 
Tyr Trp Leu Asn Ala Leu Arg Glu Ser Ser Ala Gly Asp Met Gly Arg 
1875 1880 1885 

cgt gtc gaa gcg aag ttc tgg ggc gec gtc gag cac gaa gat gtq gaa 5712 
Arg val Glu Ala Lys Phe Trp Gly Ala val Glu His Glu Asp val Glu 
1890 1895 1900 

teg ctt gca cgc gta ttg ggc att gtg gac gac ggc get get gtq gat 5760 
Ser Leu Ala Arg Val Leu Gly lie Val Asp Asp Gly Ala Ala val Asp 
1905 1910 1915 1920 

tec ctg aga age gee ctt ccg gtg ttg gec ggt tgg cag cga acc cgc 5808 
Ser Leu Arg ser Ala Leu Pro Val Leu Ala Gly Trp Gin Arg Thr Arg 

Page 123 



LeA33955-US 

1925 1930 1935 

acc acc gag tec att atg gat cag egg tgt tac cga att gqc tgg egg 5856 
Thr Thr Glu Ser lie Met Asp Gin Arg Cys Tyr Arg lie Gly Trp Arg 
1940 1945 ~ 1950 

cag gta gec gqa etc ccg ccg atg gqa act gtt ttc gqt acc tgg ctg 5904 
Gin val Ala Gly Leu Pro Pro Met Gly Thr Val Phe Gly Thr Trp Leu 
1955 1960 1965 

gtc ttc gcg cct cat gqc tgg tec age gaa ccg gag gtq gtq gac tgc 5952 
val Phe Ala Pro His Gly Trp ser Ser Glu Pro Glu val val Asp Cys 
1970 1975 1980 

gtt acg gca ctg egg gca cgt gqt gee teg gtq gtq ttg gtq gaa get 6000 
val Thr Ala Leu Arg Ala Arg Gly Ala Ser val val Leu Val Glu Ala 
1985 1990 1995 2000 

gat ccc gac ccg acc tec ttc gqc gac egg gta cga acc ctg tgt teg 6048 
Asp Pro Asp Pro Thr Ser Phe Gly Asp Arg val Arg Thr Leu Cys Ser 
2005 2010 2015 

gqc ctt ccg gat ctt gtt gqc gtq ttg tea atg ttg tgc ttg gaa gaa 6096 
Gly Leu Pro Asp Leu Val Gly Val Leu Ser Met Leu Cys Leu Glu Glu 
2020 2025 2030 

teg gtc ctt ccg gqa ttt tct gcg gtq tea egg gqt ttt gcg ttg acc 6144 
Ser val Leu Pro Gly Phe Ser Ala val Ser Arg Gly Phe Ala Leu Thr 
2035 2040 2045 

gtq gag ttg gtq egg gtt ttg egg gca get gqt gcg act gee egg ttg 6192 
Val Glu Leu val Arg val Leu Arg Ala Ala Gly Ala Thr Ala Arg Leu 
2050 ~ 2055 2060 

tgg ttg ctg acg tgt gqt gqc gtq teg gtq gqa gat gta ccg gtt cgt 6240 
Trp Leu Leu Thr Cys Gly Gly Val Ser Val Gly Asp Val Pro Val Arg 
2065 2070 2075 2080 

cca gcg cag gee ctg gcg tgg gqg ttg gqg cgt gtt gtq gqg ttg gag 6288 
Pro Ala Gin Ala Leu Ala Trp Gly Leu Gly Arg Val val Gly Leu Glu 
2085 2090 " 2095 

cat ccg gac tgg tgg gqc gqc ttg ate gat att ccg gtc ttg ttc gac 6336 
His Pro Asp Trp Trp Gly Gly Leu lie Asp He Pro Val Leu Phe Asp 
2100 2105 2110 

gaa gac get caa gag egg ttg teg att gtq ctg gca gqt etc gat gag 6384 
Glu Asp Ala Gin Glu Arg Leu Ser lie Val Leu Ala Gly Leu Asp Glu 
2115 2120 2125 

gac gag gtc gcg ate cgt cct gac gqc atg ttc gcg cgt egg ttg gta 6432 
Asp Glu val Ala lie Arg Pro Asp Gly Met Phe Ala Arg Arg Leu val 
2130 2135 2140 

cgc cac act gtc tea get gat gtq aag aag gcg tgg cgc ccc agg gqa 6480 
Arg His Thr Val Ser Ala Asp val Lys Lys Ala Trp Arg Pro Arg Gly 
2145 2150 2155 2160 

teg gtq ctg gtq acg gqc gqc acg gqt gqt ttg gqg gcg cac gtt get 6528 
Ser val Leu val Thr Gly Gly Thr Gly Gly Leu Gly Ala His val Ala 
2165 2170 ' 2175 

cgc tgg ctg gee gac gee gga gee gaa cat gtg gcg atg gtg agt cga 6576 
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Arg Trp Leu Ala Asp Ala Gly Ala Glu His val Ala Met Val Ser Arg 
2180 2185 2190 

cgc gqc gag cag gca ccg agt get gag aag ttg egg acg gaa ctg gag 6624 
Arg Gly Glu Gin Ala Pro Ser Ala Glu Lys Leu Arg Thr Glu Leu Glu 
2195 2200 2205 



gat ctg gqt acc egg gtq teg ate gtq tea tgc gat gtq ace gat cgc 
Asp Leu Gly Thr Arg Val Ser lie Val Ser Cys Asp Val Thr Asp Arg 
2210 2215 2220 



6672 



gag gcg etc gee gaa gtq ctg aaa gee ctt ccg get gaa aac ccg ttg 6720 

Glu Ala Leu Ala Glu val Leu Lys Ala Leu Pro Ala Glu Asn Pro Leu 
2225 2230 2235 2240 

acc gcg gta gtq cat gcg gca gqc gtq ate gag act gqt gat gcg gcg 6768 

Thr Ala val Val His Ala Ala Gly Val lie Glu Thr Gly Asp Ala Ala 
2245 2250 2255 

gca atg age ctg get gat ttc gat cac gtq ttg tec gca aag gtq gee 6816 

Ala Met ser Leu Ala Asp Phe Asp His vat Leu ser Ala Lys vat Ala 
2260 2265 2270 

gqt gee gcg aat ctg gat gee ttg ttg gee gat gtq gaa ttg gac gcg 6864 

Gly Ala Ala Asn Leu Asp Ala Leu Leu Ala Asp val Glu Leu Asp Ala 
2275 2280 2285 

ttc gtc ttg ttc tea teg gtq tea gqa gtt tgg gqc get gqg gqa cac 6912 

Phe val Leu Phe Ser Ser Val Ser Gly val Trp Gly Ala Gly Gly His 
2290 2295 2300 

gqg get tac gca gcg gcg aat gec tat ctg gat gcg etc gcg gaa cag 6960 

Gly Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin 
2305 2310 2315 2320 

cgt egg teg cga gqg ctg gtc gcg act gcg gtq gec tgg gqg ccg tgg 7008 

Arg Arg Ser Arg Gly Leu val Ala Thr Ala val Ala Trp Gly Pro Trp 
2325 2330 2335 

gec gqc gag gqc atg gee tec gqa gaa aca gqa gac cag ctg cgc cga 

Ala Gly Glu Gly Met Ala Ser Gly Glu Thr Gly Asp Gin Leu Arg Arg 
2340 2345 2350 

tac gqc ctt tec cca atg get ccg cag cac gee ate gec gqa ate egg 

Tyr Gly Leu ser Pro Met Ala Pro Gin His Ala He Ala Gly lie Arg 
2355 2360 2365 

cag gee gtq gaa cag gac gaa att tec ctg gta gtq gee gat gtc gat 

Gin Ala Val Glu Gin Asp Glu lie ser Leu Val Val Ala Asp Val Asp 
2370 2375 2380 



7056 



7104 



7152 



tgg gca cgt ttc age gcg gqa ttg ctg gcg get agg ccg egg ccg ctg 7200 

Trp Ala Arg Phe Ser Ala Gly Leu Leu Ala Ala Arg Pro Arg Pro Leu 
2385 2390 2395 " " 2400 

ctg aac gaa ctg gee gag gtc aag gaa etc etc gtc gat gec cag ccc 7248 

Leu Asn Glu Leu Ala Glu Val Lys Glu Leu Leu val Asp Ala Gin Pro 
2405 2410 2415 

gag gcg gqa gtc ctt gec gac gcg teg ttg gaa tgg egg cag cga ttg 7296 

Glu Ala Gly Val Leu Ala Asp Ala Ser Leu Glu Trp Arg Gin Arg Leu 
2420 2425 2430 
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tec gcg gca ccg agg ccg aca cag gaa cag ctg ate ctg gag ctg gta 7344 
Ser Ala Ala Pro Arg Pro Thr Gin Glu Gin Leu lie Leu Glu Leu val 
2435 " 2440 2445 



cgc gqc gaa acc get ctg gtq ctg gqa cac ccc gqg gca gcg gec gtt 
Arg Gly Glu Thr Ala Leu Val Leu Gly His Pro Gly Ala Ala Ala val 
2450 2455 2460 



ctg egg gcg agg ctt ctt ccc gag tec gca gqa gca gqc att ccc gag 
Leu Arg Ala Arg Leu Leu Pro Glu Ser Ala Gly Ala Gly lie Pro Glu 
2515 2520 2525 



7392 



gca teg gaa cga gee ttc aag gac age gqa ttc gac teg cag gee gcg 7440 

Ala Ser Glu Arg Ala Phe Lys Asp Ser Gly Phe Asp Ser Gin Ala Ala 
2465 2470 2475 2480 

gtc gaa etc cgc gtt egg etc aat cga get acc gqc etc cag ttg cca 7488 

val Glu Leu Arg val Arg Leu Asn Arg Ala Thr Gly Leu Gin Leu Pro 
2485 ~ 2490 2495 

teg aca att ate ttc age cat ccc acg cct gcg gaa ctg get gcg gag 7536 

Ser Thr lie lie Phe Ser His Pro Thr Pro Ala Glu Leu Ala Ala Glu 
2500 2505 2510 



7584 



gag gac gag gcg cga ate aga gcg gca ctg acg teg ate ccg ttc ccg 7632 
Glu Asp Glu Ala Arg lie Arg Ala Ala Leu Thr Ser lie Pro Phe Pro 
2530 2535 2540 

gee ttg cgc gag gca gqc ttg gtq agt ccg ctg etc gca ctt gee gqa 7680 
Ala Leu Arg Glu Ala Gly Leu val Ser Pro Leu Leu Ala Leu Ala Gly 
2545 2550 2555 2560 

cac ccg gtc gac tec gqt ate tec teg gac gat gcg gee gcg acc teg 7728 
His Pro Val Asp Ser Gly lie Ser Ser Asp Asp Ala Ala Ala Thr Ser 
2565 2570 2575 

ate gat gcg atg gat gta gee gqc etc gtc gaa gca gcg ctg gqc gaa 7776 
lie Asp Ala Met Asp Val Ala Gly Leu Val Glu Ala Ala Leu Gly Glu 
2580 2585 2590 

cgc gag tec tga 7788 
Arg Glu Ser 
2595 

<210> 42 
<211> 2595 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 42 

Met Ser Glu Ala Gly Asn Leu lie Ala val lie Gly Leu Ser Cys Arg 
1 5 10 15 

Leu Pro Gin Ala Pro Asp Pro Ala Ser Phe Trp Arg Leu Leu Arg Thr 
20 25 30 

Gly Thr Asp Ala lie Thr Thr Val Pro Glu Gly Arg Trp Gly Asp Pro 
35 40 45 

Leu Pro Gly Arg Asp Ala Pro Lys Gly Pro Glu Trp Gly Gly Phe Leu 
50 55 60 
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Ala Asp val Asp cys Phe Asp Pro Glu Phe Phe Gly lie ser Pro Ar 
65 70 75 8 

Glu Ala Ala Ala val Asp Pro Gin Gin Arg Leu Ala Leu Glu Leu Ala 
85 90 95 

Trp Glu Ala Leu Glu Asp Ala Gly lie Pro Ala Gly Glu Leu Arg Gly 
100 105 110 

Thr Ala Ala Gly val Phe Met Gly Ala lie Ser Asp Asp Tyr Ala Ala 
115 120 125 

Leu Leu Arg Glu Ser Pro Pro Glu Val Ala Ala Gin Tyr Arg Leu Thr 
130 135 140 

Gly Thr His Arg ser Leu lie Ala Asn Arg val ser Tyr val Leu Gly 
145 " 150 155 160 

Leu Arg Gly Pro ser Leu Thr val Asp Ser Gly Gin Ser Ser ser Leu 
165 170 " 175 

val Gly val His Leu Ala ser Glu Ser Leu Arg Arg Gly Glu Cys Thr 
180 185 ~ " 190 

lie Ala Leu Ala Gly Gly val Asn Leu Asn Leu Ala Ala Glu Ser Asn 
195 200 205 

Ser Ala Leu Met Asp Phe Gly Ala Leu Ser Pro Asp Gly Arg Cys Phe 
210 215 220 

Thr Phe Asp val Arg Ala Asn Gly Tyr val Arg Gly Glu Gly Gly Gly 
225 230 235 240 

Leu Val val Leu Lys Lys Ala Asp Gin Ala His Ala Asp Gly Asp Arg 
245 250 255 

lie Tyr Cys Leu lie Arg Gly Ser Ala Val Asn Asn Asp Gly Gly Gly 
260 265 270 

Ala Gly Leu Thr Val Pro Ala Ala Asp Ala Gin Ala Glu Leu Leu Arg 
275 280 285 

Gin Ala Tyr Arg Asn Ala Gly val Asp Pro Ala Ala val Gin Tyr Val 
290 " 295 300 

Glu Leu His Gly Ser Ala Thr Arg Val Gly Asp Pro Val Glu Ala Ala 
305 310 ' 315 320 

Ala Leu Gly Ala Val Leu Gly Ala Ala Arg Arg Pro Gly Asp Glu Leu 
325 330 335 

Arg val Gly ser Ala Lys Thr Asn val Gly His Leu Glu Ala Ala Ala 
340 345 350 

Gly val Thr Gly Leu Leu Lys Thr Ala Leu Ser lie Trp His Arg Glu 
355 360 365 

Leu Pro Pro Ser Leu His Phe Thr Ala Pro Asn Pro Glu lie Pro Leu 
370 375 380 

Asp Glu Leu Asn Leu Arg val Gin Arg Asp Leu Arg Pro Trp Pro Glu 
385 390 395 " 400 
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ser Glu Gly Pro Leu Leu Ala Gly Val Ser Ala Phe Gly Met Gly Gly 
405 410 415 

Thr Asn Cys His Leu Val Leu Ser Gly Thr Ser Arg Val Glu Arg Arg 
420 425 ~ 430 

Arg Ser Gly Pro Ala Glu Ala Thr Met Pro Trp val Leu Ser Ala Arg 
435 440 445 

Thr Pro Val Ala Leu Arg Ala Gin Ala Ala Arg Leu His Thr His Leu 
450 455 460 

Asn Thr Ala Gly Gin ser Pro Leu Asp val Ala Tyr ser Leu Ala Thr 
465 470 475 480 

Thr Arg Ser Ala Leu Pro His Arg Ala Ala Leu Val Ala Asp Asp Glu 
485 490 495 

Pro Lys Leu Leu Ala Gly Leu Lys Ala Leu Ala Asp Gly Asp Asp Ala 
500 505 510 

Pro Thr Leu Cys His Gly Ala Thr Ser Gly Glu Arg Ala Ala val Phe 
515 520 525 

val Phe Pro Gly Gin Gly Ser Gin Trp lie Gly Met Gly Arg Gin Leu 
530 535 540 

Leu Glu Thr Ser Glu val Phe Ala Ala Ser Met Ser Asp Cys Ala Asp 
545 550 555 560 

Ala Leu Ala Pro His Leu Asp Trp Ser Leu Leu Asp val Leu Arg Asn 
565 570 575 

Ala Ala Gly Ala Ala His Leu Asp His Asp Asp val Val Gin Pro Ala 
580 585 590 

Leu Phe Ala lie Met val Ser Leu Ala Glu Leu Trp Arg ser Trp Gly 
595 600 605 

val Arg Pro Val Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala 
610 615 620 

Ala Cys val Ala Gly Ala Leu ser val Arg Asp Ala Ala Arg val val 
625 630 635 640 

Ala Val Arg ser Arg Leu Leu Thr Ala Leu Ala Gly Ser Gly Ala Met 
645 650 655 

Ala Ser Leu Gin His Pro Ala Glu Glu Val Arg Gin lie Leu Leu Pro 
660 665 670 

Trp Arg Asp Arg lie Gly val Ala Gly val Asn Gly Pro Ser Ser Thr 
675 680 685 

Leu val Ser Gly Asp Arg Glu Ala Met Ala Glu Leu Leu Ala Glu Cys 
690 695 700 

Ala Asp Arg Glu Leu Arg Met Arg Arg lie Pro val Glu Tyr Ala Ser 
705 710 715 720 

His ser Pro His lie Glu val val Arg Asp Glu Leu Leu Gly Leu Leu 
725 730 735 
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Ala Pro val Glu Pro Arg Thr Gly Ser lie Pro lie Tyr ser Thr Thr 
740 ~ 745 750 

Thr Gly Asp Leu Leu Asp Arg Pro Met Asp Ala Asp Tyr Trp Tyr Arg 
755 760 765 

Asn Leu Arg Gin Pro val Leu Phe Glu Ala Ala Val Glu Ala Leu Leu 
770 775 780 

Lys Arg Gly Tyr Asp Ala Phe lie Glu lie Ser Pro His Pro val Leu 
785 790 795 800 

Thr Ala Asn He Gin Glu Thr Ala val Arg Ala Gly Arg Glu Val Val 
805 810 815 

Ala Leu Gly Thr Leu Arg Arg Gly Glu Gly Gly Met Arg Gin Ala Leu 
820 " 825 830 

Thr Ser Leu Ala Arg Ala His Val His Gly val Ala Ala Asp Trp His 
835 840 845 

Ala val Phe Ala Gly Thr Gly Ala Gin Arg Val Asp Leu Pro Thr Tyr 
850 855 860 

Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ala Lys Leu Pro Asp val 
865 870 875 880 

Ala Met Pro Glu Ser Asp Val Ser Thr Ala Leu Arg Glu Lys Leu Arg 
885 890 ~ 895 

Ser Ser Pro Arg Ala Asp Val Asp Ser Thr Thr Leu Thr Met lie Arg 
900 905 910 

Ala Gin Ala Ala val val Leu Gly His Ser Asp Pro Lys Glu val Asp 
915 920 925 

Pro Asp Arg Thr Phe Lys Asp Leu Gly Phe Asp Ser Ser Met val val 
930 935 940 

Glu Leu cys Asp Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Ala Pro 
945 950 955 960 

Ser val val Phe Asp Cys Pro Thr Pro Asp Lys Leu Ala Arg Gin val 
965 970 975 

Arg Thr Leu Leu Leu Gly Glu Pro Ala Pro Met Thr Ser His Arg Pro 
980 985 990 

Asp ser Asp Ala Asp Glu Pro lie Ala val lie Gly Met Gly Cys Arg 
995 1000 1005 

Phe Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Gin Leu Val Ala 
1010 1015 1020 

Ala Gly Arg Asp Val val Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp 
025 1030 1035 " 1040 

Leu Glu Arg Ala Gly Thr Ser His val Arg Ala Gly Gly Phe Leu His 
1045 1050 1055 

Gly Ala Pro Asp Phe Asp Pro Gly Phe Phe Arg lie Ser Pro Arg Glu 
1060 1065 1070 
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Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu lie Ala Trp 
1075 1080 1085 

Glu Ala val Glu Arg Gly Gly lie Asn Pro Gin His Leu His Gly Ser 
1090 1095 1100 

Gin Thr Gly val Phe val Gly Ala Thr ser Leu Asp Tyr Gly Pro Arg 
105 1110 1115 1120 

Leu His Glu Ala Ser Glu Glu Ala Ala Gly Tyr Val Leu Thr Gly Ser 
1125 1130 1135 

Thr Thr ser Val Ala Ser Gly Arg val Ala Tyr Ser Phe Gly Phe Glu 
1140 1145 1150 

Gly Pro Ala val Thr val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
1155 1160 1165 

Leu His Leu Ala cys Gin Ser Leu Arg ser Gly Glu Cys Asp Leu Ala 
1170 1175 1180 

Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Gly Met Phe Val Glu 
185 1190 1195 1200 

Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe 
1205 1210 1215 

Ala Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Ala Gly Leu val 
1220 1225 1230 

Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Glu val Leu 
1235 1240 ~ 1245 

Ala val val Arg Gly ser Ala val Asn Gin Asp Gly Ala Ser Asn Gly 
1250 1255 1260 

Leu Thr Ala Pro Asn Gly Ser Ser Gin Gin Arg Val lie Ala Gin Ala 
265 1270 1275 1280 

Leu Ala Ser Ala Gly Leu Ser val Ser Asp val Asp Ala val Glu Ala 
1285 1290 1295 

His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu 
1300 ~ 1305 1310 

lie Ala Thr Tyr Gly Gin Gly Arg Leu Pro Glu Arg Pro Leu Trp Leu 
1315 1320 1325 

Gly Ser Met Lys Ser Asn lie Gly His Ala Gin Ala Ala Ala Gly lie 
1330 1335 1340 

Ala Gly Val Met Lys Met val Met Ala Met Arg His Gly Gin Leu Pro 
345 1350 1355 1360 

Arg Thr Leu His val Asp Glu Pro Thr Ser Gly Val Asp Trp Ser Ala 
1365 1370 1375 

Gly Thr val Gin Leu Leu Thr Glu Asn Thr Pro Trp Pro Gly Ser Gly 
1380 1385 1390 

Arg val Arg Arg val Gly val Ser Ser Phe Gly lie Ser Gly Thr Asn 
1395 1400 1405 
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Ala His val lie Leu Glu Gin Pro Pro Gly val Pro Ser Gin Ser Ala 
1410 1415 1420 

Gly Pro Gly ser Gly Ser val Val Asp Val Pro Val Val Pro Trp Met 
425 1430 1435 1440 

Val Ser Gly Lys Thr Pro Glu Ala Leu Ser Ala Gin Ala Thr Ala Leu 
1445 1450 1455 

Met Thr Tyr Leu Asp Glu Arg Pro Asp val ser Ser Leu Asp val Gly 
1460 1465 1470 

Tyr Ser Leu Ala Leu Thr Arg ser Ala Leu Asp Glu Arg Ala Val Val 
1475 1480 1485 

Leu Gly Ser Asp Arg Glu Thr Leu Leu Cys Gly Val Lys Ala Leu ser 
1490 1495 1500 

Ala Gly His Glu Ala Ser Gly Leu Val Thr Gly ser val Gly Ala Gly 
505 1510 1515 1520 

Gly Arg lie Gly Phe val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly 
1525 1530 * 1535 

Met Gly Arg Gly Leu Tyr Arg Ala Phe Pro Val Phe Ala Ala Ala Phe 
1540 1545 1550 

Asp Glu Ala Cys Ala Glu Leu Asp Ala His Leu Gly Gin Glu lie Gly 
1555 1560 1565 

Val Arg Glu Val val Ser Gly Ser Asp Ala Gin Leu Leu Asp Arg Thr 
1570 1575 1580 

Leu Trp Ala Gin Ser Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Lys 
585 1590 1595 1600 

Leu Leu Asp Ser Trp Gly Val Arg Pro Ser val val Leu Gly His Ser 
1605 1610 1615 

val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Val Ser Leu Ser 
1620 1625 1630 

Gly Ala Ala Arg Leu Val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu 
1635 1640 1645 

Pro Ser Gly Gly Gly Met Leu Ala Val Pro Ala Gly Glu Glu Leu Leu 
1650 1655 1660 

Trp Ser Leu Leu Ala Asp Gin Gly Asp Arg Val Gly lie Ala Ala Val 
665 1670 1675 1680 

Asn Ala Ala Gly Ser val val Leu Ser Gly Asp Arg Asp val Leu Asp 
1685 1690 1695 

Asp Leu Ala Gly Arg Leu Asp Gly Gin Gly lie Arg Ser Arg Trp Leu 
1700 1705 1710 

Arg val Ser His Ala Phe His ser Tyr Arg Met Asp Pro Met Leu Ala 
1715 1720 1725 

Glu Phe Ala Glu Leu Ala Arg Thr val Asp Tyr Arg Arg Cys Glu val 
1730 1735 1740 
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Pro lie Val ser Thr Leu Thr Gly Asp Leu Asp Asp Ala Gly Arg Met 
745 1750 1755 1760 

Ser Gly Pro Asp Tyr Trp val Arg Gin val Arg Glu Pro Val Arg Phe 
1765 " 1770 1775 

Ala Asp Gly Val Gin Ala Leu Val Glu His Asp Val Ala Thr Val Val 
1780 1785 1790 

Glu Leu Gly Pro Asp Gly Ala Leu Ser Ala Leu lie Gin Glu Cys val 
1795 1800 1805 

Ala Ala ser Asp His Ala Gly Arg Leu Ser Ala val Pro Ala Met Arg 
1810 1815 1820 

Arg Asn Gin Asp Glu Ala Gin Lys val Met Thr Ala Leu Ala His val 
825 1830 1835 1840 

His Val Arg Gly Gly Ala Val Asp Trp Arg Ser Phe Phe Ala Gly Thr 
1845 1850 1855 

Arg Ala Lys Gin lie Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg 
1860 1865 1870 

Tyr Trp Leu Asn Ala Leu Arg Glu Ser ser Ala Gly Asp Met Gly Arg 
1875 1880 1885 

Arg Val Glu Ala Lys Phe Trp Gly Ala Val Glu His Glu Asp val Glu 
1890 1895 1900 

Ser Leu Ala Arg Val Leu Gly lie val Asp Asp Gly Ala Ala val Asp 
905 " 1910 1915 1920 

Ser Leu Arg Ser Ala Leu Pro Val Leu Ala Gly Trp Gin Arg Thr Arg 
1925 1930 1935 

Thr Thr Glu Ser lie Met Asp Gin Arg Cys Tyr Arg lie Gly Trp Arg 
1940 1945 " 1950 

Gin val Ala Gly Leu Pro Pro Met Gly Thr val Phe Gly Thr Trp Leu 
1955 1960 1965 

Val Phe Ala Pro His Gly Trp Ser ser Glu Pro Glu val val Asp Cys 
1970 1975 1980 

Val Thr Ala Leu Arg Ala Arg Gly Ala Ser Val Val Leu val Glu Ala 
985 1990 1995 2000 

Asp Pro Asp Pro Thr ser Phe Gly Asp Arg val Arg Thr Leu Cys ser 
2005 2010 2015 

Gly Leu Pro Asp Leu val Gly val Leu Ser Met Leu Cys Leu Glu Glu 
2020 2025 2030 

Ser Val Leu Pro Gly Phe Ser Ala val ser Arg Gly Phe Ala Leu Thr 
2035 2040 ~ 2045 

val Glu Leu Val Arg val Leu Arg Ala Ala Gly Ala Thr Ala Arg Leu 
2050 2055 2060 

Trp Leu Leu Thr Cys Gly Gly val Ser val Gly Asp val Pro val Arg 
065 2070 2075 2080 
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Pro Ala Gin Ala Leu Ala Trp Gly Leu Gly Arg val val Gly Leu Glu 
2085 2090 2095 

His Pro Asp Trp Trp Gly Gly Leu lie Asp lie Pro Val Leu Phe Asp 
2100 2105 2110 

Glu Asp Ala Gin Glu Arg Leu Ser lie val Leu Ala Gly Leu Asp Glu 
2115 2120 2125 

Asp Glu val Ala lie Arg Pro Asp Gly Met Phe Ala Arg Arg Leu val 
2130 2135 2140 

Arg His Thr val ser Ala Asp val Lys Lys Ala Trp Arg Pro Arg Gly 
145 2150 2155 2160 

Ser Val Leu val Thr Gly Gly Thr Gly Gly Leu Gly Ala His val Ala 
2165 2170 2175 

Arg Trp Leu Ala Asp Ala Gly Ala Glu His val Ala Met val Ser Arg 
2180 2185 2190 

Arg Gly Glu Gin Ala Pro Ser Ala Glu Lys Leu Arg Thr Glu Leu Glu 
2195 2200 2205 

Asp Leu Gly Thr Arg val Ser lie val Ser Cys Asp Val Thr Asp Arg 
2210 2215 2220 

Glu Ala Leu Ala Glu val Leu Lys Ala Leu Pro Ala Glu Asn Pro Leu 
225 2230 2235 2240 

Thr Ala val Val His Ala Ala Gly Val lie Glu Thr Gly Asp Ala Ala 
2245 2250 2255 

Ala Met ser Leu Ala Asp Phe Asp His val Leu Ser Ala Lys val Ala 
2260 2265 2270 

Gly Ala Ala Asn Leu Asp Ala Leu Leu Ala Asp val Glu Leu Asp Ala 
2275 2280 2285 

Phe Val Leu Phe Ser ser val Ser Gly Val Trp Gly Ala Gly Gly His 
2290 2295 2300 

Gly Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin 
305 2310 2315 2320 

Arg Arg Ser Arg Gly Leu Val Ala Thr Ala Val Ala Trp Gly Pro Trp 
2325 2330 2335 

Ala Gly Glu Gly Met Ala Ser Gly Glu Thr Gly Asp Gin Leu Arg Arg 
2340 2345 * 2350 

Tyr Gly Leu Ser Pro Met Ala Pro Gin His Ala lie Ala Gly lie Arg 
2355 2360 2365 

Gin Ala val Glu Gin Asp Glu lie ser Leu val val Ala Asp val Asp 
2370 2375 2380 

Trp Ala Arg Phe Ser Ala Gly Leu Leu Ala Ala Arg Pro Arg Pro Leu 
385 2390 2395 ~ 2400 

Leu Asn Glu Leu Ala Glu val Lys Glu Leu Leu Val Asp Ala Gin Pro 
2405 2410 2415 
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Glu Ala Gly val Leu Ala Asp Ala Ser Leu Glu Trp Arg Gin Arg Leu 
2420 2425 2430 

ser Ala Ala Pro Arg Pro Thr Gin Glu Gin Leu lie Leu Glu Leu val 
2435 ~ 2440 2445 

Arg Gly Glu Thr Ala Leu Val Leu Gly His Pro Gly Ala Ala Ala Val 
2450 2455 2460 

Ala Ser Glu Arg Ala Phe Lys Asp Ser Gly Phe Asp Ser Gin Ala Ala 
465 ~ 2470 2475 2480 

Val Glu Leu Arg val Arg Leu Asn Arg Ala Thr Gly Leu Gin Leu Pro 
2485 2490 2495 

Ser Thr lie lie Phe Ser His Pro Thr Pro Ala Glu Leu Ala Ala Glu 
2500 2505 2510 

Leu Arg Ala Arg Leu Leu Pro Glu Ser Ala Gly Ala Gly He Pro Glu 
2515 2520 2525 

Glu Asp Glu Ala Arg He Arg Ala Ala Leu Thr Ser lie Pro Phe Pro 
2530 ~ 2535 2540 

Ala Leu Arg Glu Ala Gly Leu Val Ser Pro Leu Leu Ala Leu Ala Gly 
545 2550 2555 2560 

His Pro val Asp Ser Gly lie Ser ser Asp Asp Ala Ala Ala Thr Ser 
2565 2570 2575 

lie Asp Ala Met Asp val Ala Gly Leu val Glu Ala Ala Leu Gly Glu 
2580 2585 2590 

Arg Glu Ser 
2595 



<210> 43 
<211> 6459 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1).. (6456) 

<223> ORF19; polyketide synthase 

<400> 43 

atg acg gtq acc acc agt tac gaa gaa gtt gtc gag gca ctg cga gca 48 

Met Thr val Thr Thr Ser Tyr Glu Glu Val Val Glu Ala Leu Arg Ala 
1 5 10 15 

teg etc aag gag aac gaa cgc etc egg cgc gqc agg gat egg ttc tec 96 
Ser Leu Lys Glu Asn Glu Arg Leu Arg Arg Gly Arg Asp Arg Phe Ser 
20 25 ^ 30 

gcg gag aag gac gat ccc ate gcg ate gtq gcg atg agt tgt cgt tat 144 
Ala Glu Lys Asp Asp Pro lie Ala lie Val Ala Met Ser Cys Arg Tyr 
35 40 45 

ccc ggt cag gtc tec teg ccg gag gac ctg tgg caa ctg get gee gqc 192 
Pro Gly Gin val Ser Ser Pro Glu Asp Leu Trp Gin Leu Ala Ala Gly 
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50 55 60 

ggt gtg gac gcg ate tec gaa gtt ccg ggg gat cgc gga tgg gac ctg 240 
~ly val Asp Ala lie Ser Glu Val Pro Gly Asp Arg Gl\ 



Gly val Asp Ala lie Ser Glu Val Pro Gly Asp Arg Gly Trp Asp Leu 

65 70 75 ^ 80 

gat ggc gtq ttc gtt ccg gac tec gat cgt cct ggc acg teg tat gec 

Asp Gly val Phe Val Pro Asp Ser Asp Arg Pro Gly Thr Ser Tyr Ala 

85 90 95 



ggt cat gtg ctg acg ggc ggt gcg gca gcg gtt ctt teg ggc egg gtt 
Gly His Val Leu Thr Gly Gly Ala Ala Ala Val Leu Ser Gly Arg Val 
180 " 185 190 

gcg tat teg ttc ggg ttc gag ggt cct gcg gtg acg gtg gat acg gcg 
Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala val Thr Val Asp Thr Ala 
195 200 205 



gat ggg egg tgc aag teg ttc gcg gcg ggt gcg gat ggc act gga tgg 
Asp Gly Arg Cys Lys Ser Phe Ala Ala Gly Ala Asp Gly Thr Gly Trp 
260 265 270 



288 



tgc gcg ggc ggt ttt ctt cag ggc gtg teg gag ttc gac gcg ggt ttc 336 

Cys Ala Gly Gly Phe Leu Gin Gly val Ser Glu Phe Asp Ala Gly Phe 

100 105 110 

ttc ggg att teg ccg cgt gag gcg ctg gcg atg gat ccg cag cag egg 384 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
115 120 125 

ttg ctg ctg gaa gtc gcg tgg gag gtc ttc gag egg get ggg ctg gag 432 

Leu Leu Leu Glu Val Ala Trp Glu Val Phe Glu Arg Ala Gly Leu Glu 
130 135 140 

cag egg teg aca cgc ggt tec cgc gtt ggc gtg ttc gtc ggc ace aat 480 

Gin Arg Ser Thr Arg Gly Ser Arg val Gly val Phe val Gly Thr Asn 

145 " 150 " 155 160 

ggc cag gac tac gcg teg tgg ttg egg acg ccg ccg cct gcg gtg gca 528 

Gly Gin Asp Tyr Ala Ser Trp Leu Arg Thr Pro Pro Pro Ala val Ala 
165 170 175 



576 



624 



tgt teg teg teg ttg gtg gcg ttg cac ctg gcg ggg caa gca ctg egg 672 

cys ser Ser Ser Leu val Ala Leu His Leu Ala Gly Gin Ala Leu Arg 
210 215 220 

gee ggt gag tgc gac ctt gee ctt gec ggt ggc gtc acg gtg atg teg 720 

Ala Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr val Met Ser 

225 230 235 240 

acg ccg aag gtg ttc ctg gag ttc tec cgc caa egg ggt etc gcg ccg 768 

Thr Pro Lys val Phe Leu Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro 

245 250 255 



816 



ggt gag ggt gec gga ctg ttg ttg ctg gag egg ttg teg gat gec egg 864 
Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg 
275 280 285 

egg aat ggg cat gag gtg ctg gcg gtt gtt cgt ggt agt gcg gtg aat 912 
Arg Asn Gly His Glu Val Leu Ala val Val Arg Gly Ser Ala Val Asn 
290 295 300 

cag gac ggt gcg teg aat ggt ttg acc gcg ccg aat ggt teg teg cag 960 
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Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Ser Ser Gin 
305 ' 310 315 320 

cag egg gtg att acc cag gcg ttg gcg agt gcg ggg ttg teg gtg tec 1008 

Gin Arg val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser val Ser 

325 330 335 

gat gtq gat get gtq gag gcg cat gqg acg ggc acg egg ctt ggt gat 1056 

Asp val Asp Ala val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp 
340 345 350 

ccg ate gag gcg cag gcg ctg ate gee acc tac gqc cgt gat cgt gat 1104 

Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg Asp 
355 360 365 

cct gqc egg ccg ttg tgg ttg gqg teg gtc aag teg aac ate ggt cat 1152 

Pro Gly Arg Pro Leu Trp Leu Gly Ser val Lys Ser Asn lie Gly His 
370 375 380 

acg caa gcg gcg gcg ggt gtg get ggt gtg ate aag atg gtg atg gcg 

Thr Gin Ala Ala Ala Gly val Ala Gly val lie Lys Met val Met Ala 
385 390 395 400 



gtt ccg gtg gtg ccg tgg atg gtg teg ggc aaa aca ccc gaa gcg eta 
val Pro val val Pro Trp Met val ser Gly Lys Thr Pro Glu Ala Leu 
485 490 495 



tgc ggt gtg aaa gcg ctg tct gee ggt cat gag get tct ggg ttg gtg 

Cys Gly val Lys Ala Leu Ser Ala Gly His Glu Ala Ser Gly Leu Val 

545 550 555 560 
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atg egg cac ggg cag ctg cca cgc acg ttg cac gtg gaa teg ccg teg 1248 
Met Arg His Gly Gin Leu Pro Arg Thr Leu His val Glu Ser Pro Ser 
405 410 415 

ccg gag gtg gat tgg teg gcg ggg acg gtt caa etc ctt acg gag aac 1296 
Pro Glu Val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu Asn 
420 425 430 

acg ccc tgg ccc agg agt ggt cgt gtt cgt egg gtg ggg gtg teg teg 1344 
Thr Pro Trp Pro Arg Ser Gly Arg val Arg Arg val Gly val Ser Ser 
435 ' 440 " 445 

ttc ggg ate agt ggt act aac gcg cac gtc ate etc gaa cag ccc ccg 1392 
Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Gin Pro Pro 
450 455 460 

gga gtg ccg agt cag tct gcg ggg ccg ggt teg ggt tct gtc gtg gat 1440 
Gly val Pro Ser Gin Ser Ala Gly Pro Gly Ser Gly Ser Val Val Asp 
465 470 475 480 



1488 



tec gcg cag gca acg gcg ttg atg acc tat ctg gac gag cga cct gat 1536 
Ser Ala Gin Ala Thr Ala Leu Met Thr Tyr Leu Asp Glu Arg Pro Asp 
500 505 510 

gtc tec teg ctg gat gtt ggg tac teg ctg gcg ttg aca egg teg gcg 1584 
val Ser Ser Leu Asp Val Gly Tyr Ser Leu Ala Leu Thr Arg ser Ala 
515 520 525 

ctg gat gag cga gcg gtg gtg ctg ggg teg gac cgt gaa acg ttg ttg 1632 
Leu Asp Glu Arg Ala Val val Leu Gly Ser Asp Arg Glu Thr Leu Leu 
530 535 540 



1680 
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acc gqa tct gtq gqg get gqg gqc cgc ate gqg ttt gtq ttt tec gqt 1728 

Thr Gly Ser val Gly Ala Gly Gly Arg lie Gly Phe val Phe Ser Gly 
565 570 575 



cag gqt gqt cag tgg ctg gqg atg gqc egg gqg ctt tac egg get ttt 
Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu Tyr Arg Ala Phe 
580 585 ~ 590 



cat ctg gqc cag gaa ate gqg gtt egg gag gtq gtq tec gqt teg gat 
His Leu Gly Gin Glu lie Gly Val Arg Glu val Val Ser Gly Ser Asp 
610 615 620 



gat tac egg cgt tgt gaa gtq ccg ate gtq teg acc ttg acc gqa gac 
Asp Tyr Arg Arg cys Glu val Pro lie val ser Thr Leu Thr Gly Asp 
785 790 795 800 



1776 



ccg gtq ttc get get gee ttt gac gaa get tgt gee gag ctg gat gca 1824 
Pro Val Phe Ala Ala Ala Phe Asp Glu Ala cys Ala Glu Leu Asp Ala 
595 600 605 



1872 



gcg cag ttg ctg gat egg acg ttg tgg gcg cag teg gqt ttg ttc gcg 1920 
Ala Gin Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

ttg cag gtq gqc ttg ctg aag ttg ctg gat teg tgg gqg gtt egg ccg 1968 
Leu Gin val Gly Leu Leu Lys Leu Leu Asp Ser Trp Gly Val Arg Pro 
645 650 655 

agt gtq gtq ttg gqg cat teg gtq gqc gag ttg gcg gcg gcg ttc gcg 
Ser val val Leu Gly His Ser val Gly Glu Leu Ala Ala Ala Phe Ala 
660 665 670 

gcg gqt gtq gtq teg ttg teg gqt gcg get egg ttg gtq gcg gqt cgt 
Ala Gly val Val Ser Leu Ser Gly Ala Ala Arg Leu val Ala Gly Arg 
675 680 685 

gee egg ttg atg cag gcg ttg ccg tct gqc gqt gqg atg ctg gcg gtq 
Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Gly Met Leu Ala val 
690 695 700 

cct get gqt gag gag ctg ttg tgg teg ttg ttg gee gat cag gqt gat 
Pro Ala Gly Glu Glu Leu Leu Trp Ser Leu Leu Ala Asp Gin Gly Asp 
705 710 715 720 

cgt gtq gqg ate gee gcg gtc aac get gcg gqg teg gtq gtq etc tct 
Arg val Gly lie Ala Ala Val Asn Ala Ala Gly Ser Val val Leu Ser 
725 730 735 



2016 



2064 



2112 



2160 



2208 



gqt gat egg gat gtq etc gat gac ctt gee gqt egg ctg gac gqg caa 2256 

Gly Asp Arg Asp val Leu Asp Asp Leu Ala Gly Arg Leu Asp Gly Gin 

740 745 ' ~ 750 

gqg ate egg teg agg tgg ttg egg gtq teg cat gcg ttt cat teg tat 2304 

Gly lie Arg ser Arg Trp Leu Arg Val ser His Ala Phe His Ser Tyr 

755 760 765 

egg atg gat ccg atg ctg gcg gag ttc gee gaa ttg gca cga acc gtq 2352 

Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Thr Val 
770 775 780 



2400 



etc gat gac get gqc agg atg age gqg ccc gac tac tgg gtq cgt cag 2448 
Leu Asp Asp Ala Gly Arg Met Ser Gly Pro Asp Tyr Trp Val Arg Gin 
805 810 815 
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gtq cga gag ccg gtc cgc ttc gcc gac gqt gtc cag gcg ctg gtc gag 2496 
val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Gin Ala Leu val Glu 
820 825 830 

cac gat gtq gcc act gtt gtc gag etc gqt ccg gac ggg gcg ttg teg 2544 
His Asp Val Ala Thr val Val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

gcg ctg ate cag gaa tgt gtc gcc gca tec gat cac gcc gqg egg ctg 2592 
Ala Leu He Gin Glu cys Val Ala Ala Ser Asp His Ala Gly Arg Leu 
850 855 860 

age gcg gtc ccg gcg atg cgc agg aac cag gac gag gcg cag aag gtq 2640 
Ser Ala Val Pro Ala Met Arg Arg Asn Gin Asp Glu Ala Gin Lys Val 
865 870 875 880 

atg acg gcc ctg gca cac gtc cac gta cgt gqt gqt gcg gtq gac tgg 2688 
Met Thr Ala Leu Ala His Val His val Arg Gly Gly Ala Val Asp Trp 
885 890 895 

egg teg ttc ttc gcc gqt acg gga gcg aaa caa ate gag ctg ccc ace 2736 
Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin lie Glu Leu Pro Thr 
900 905 910 

tac gcc ttc caa cga cag egg tac tgg ctg gtq cca teg gat tec gqt 2784 
Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Val Pro Ser Asp Ser Gly 
915 920 925 

gat gtq aca gqt gcc gqt ctg gcc gqg gcg gag cat ccg ctg ttg gqt 2832 
Asp Val Thr Gly Ala Gly Leu Ala Gly Ala Glu His Pro Leu Leu Gly 
930 935 940 

get gtq gtq ccg gtc gcg gqt gqt gac gag gtq ttg ctg ace gqc agg 2880 
Ala val val Pro val Ala Gly Gly Asp Glu val Leu Leu Thr Gly Arg 
945 950 955 960 

att teg gtq egg acg cat ccg tgg ctg gcc gaa cac egg gtq ctg gqt 2928 
He Ser Val Arg Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly 
965 970 975 

gaa gtq ate gtt gcg gqc ace gcg ttg ctg gag ate gcc ttg cac gcg 2976 
Glu val lie val Ala Gly Thr Ala Leu Leu Glu lie Ala Leu His Ala 
980 985 990 

gqg gaa cgt ctt gqt tgt gaa egg gtq gaa gag etc ace ctg gaa gca 3024 
Gly Glu Arg Leu Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala 
995 1000 1005 

ccg ctg gtc ctg ccg gag cgc gqg gcg ate cag gtt cag ctg cga gtq 3072 
Pro Leu val Leu Pro Glu Arg Gly Ala lie Gin Val Gin Leu Arg Val 
1010 1015 1020 

gqc gcg ccc gag aat tec gga cgc agg ccg atg gcg ctg tat tea cgc 3120 
Gly Ala Pro Glu Asn Ser Gly Arg Arg Pro Met Ala Leu Tyr Ser Arg 
1025 1030 1035 1040 

ccc gaa gqg gcg gcg gag cat gac tgg acg egg cac gcc acg gqc egg 3168 
Pro Glu Gly Ala Ala Glu His Asp Trp Thr Arg His Ala Thr Gly Arg 
1045 1050 1055 

ttg gcg cca gqc cgc gqc gag gcg get gga gac ctg gcc gac tgg ccg 3216 
Leu Ala Pro Gly Arg Gly Glu Ala Ala Gly Asp Leu Ala Asp Trp Pro 
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1060 1065 1070 

get cct gqc gcg ctg ccg gtc gac etc gac gaa ttc tat egg gac etc 3264 
Ala Pro Gly Ala Leu Pro val Asp Leu Asp Glu Phe Tyr Arg Asp Leu 
1075 1080 1085 

gca gag ctt gqg ctg gag tac gqc ccg ate ttc caa gqg etc aag gcg 3312 
Ala Glu Leu Gly Leu Glu Tyr Gly Pro lie Phe Gin Gly Leu Lys Ala 
1090 1095 1100 

gec tgg egg caa gqg gac gag gtq tac gee gaa gee gcg ctg ccg gqa 3360 
Ala Trp Arg Gin Gly Asp Glu Val Tyr Ala Glu Ala Ala Leu Pro Gly 
1105 1110 1115 1120 

acg gaa gat tct gqt ttc gqg gtq cat ccg gca ctg ctg gac gcg get 3408 
Thr Glu Asp ser Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala 
1125 1130 1135 

ctg cac gca acg get gtc cga gac atg gat gac gca cgc ttg ccg ttc 3456 
Leu His Ala Thr Ala Val Arg Asp Met Asp Asp Ala Arg Leu Pro Phe 
1140 1145 1150 



cag tgg gaa gqt gtq tec ctg cac gee aag gee gcg ccg get ttg egg 
Gin Trp Glu Gly Val Ser Leu His Ala Lys Ala Ala Pro Ala Leu Arg 
1155 1160 1165 

gtc cgc gtq gtc ccg get gqt gac gat gee aag tec ctg ctg gtt tgt 
val Arg val Val Pro Ala Gly Asp Asp Ala Lys Ser Leu Leu Val Cys 
1170 1175 1180 

gat gqc ace gqt cga ccg gtq ate teg gtq gac cga etc gta ttg egg 
Asp Gly Thr Gly Arg Pro Val He Ser val Asp Arg Leu val Leu Arg 
1185 1190 1195 1200 



cag cca ccg tec tgc gtq ctt etc gqc acc tea gaa gtq tec get gac 

Gin Pro Pro Ser Cys val Leu Leu Gly Thr Ser Glu Val Ser Ala Asp 

1235 1240 1245 

ata cag gtq tat ccg gac etc egg teg ttg acg get gcg ttg gat gee 

lie Gin val Tyr Pro Asp Leu Arg Ser Leu Thr Ala Ala Leu Asp Ala 

1250 1255 1260 



3504 



3552 



3600 



teg get gcg gee egg egg acc gqt gcg cgc cga cag gec cat caa get 3648 

Ser Ala Ala Ala Arg Arg Thr Gly Ala Arg Arg Gin Ala His Gin Ala 

1205 ~ 1210 " 1215 

egg ttg tac egg ttg age tgg cca acg gtt caa ctg ccg aca tec get 3696 

Arg Leu Tyr Arg Leu Ser Trp Pro Thr Val Gin Leu Pro Thr Ser Ala 

1220 1225 1230 



3744 



3792 



gqt gec gaa cca ccc gqc gtc gtc ate gca ccc acg ccc ccc gqc gqt 3840 

Gly Ala Glu Pro Pro Gly val val lie Ala Pro Thr Pro Pro Gly Gly 
1265 1270 1275 1280 

gqa cga aca gcg gat gtc egg gag acg act egg cat gca etc gac ctg 3888 

Gly Arg Thr Ala Asp Val Arg Glu Thr Thr Arg His Ala Leu Asp Leu 
1285 ~ 1290 1295 

gta caa gqc tgg ctt tec gat cag cga etc aac gaa tec cga ttg etc 3936 

Val Gin Gly Trp Leu Ser Asp Gin Arg Leu Asn Glu Ser Arg Leu Leu 
1300 1305 1310 

ctg gtg aca cag gga gca gtg gec gtg gag ccg ggc gaa ccc gtg acc 3984 
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Leu val Thr Gin Gly Ala Val Ala Val Glu Pro Gly Glu Pro Val Thr 
1315 1320 1325 

gat ctg gcg cag gcc gcg etc tgg gga ctg ctg egg teg acg cag acc 4032 
Asp Leu Ala Gin Ala Ala Leu Trp Gly Leu Leu Arg Ser Thr Gin Thr 
1330 1335 1340 

gaa cac cct gat cgc ttc gtc etc gtc gat gtg cct gag ccc gcg caa 4080 
Glu His Pro Asp Arg Phe Val Leu Val Asp val Pro Glu Pro Ala Gin 
1345 1350 1355 1360 

etc etc ccc gcg ctg ccg ggg gtq ctg gcc tgc gqc gaa cct cag etc 4128 
Leu Leu Pro Ala Leu Pro Gly Val Leu Ala Cys Gly Glu Pro Gin Leu 
1365 1370 1375 

gcg ttg cga cgt gqc gqc get cat gcg ccc aga ctg get gga ctg gqc 
Ala Leu Arg Arg Gly Gly Ala His Ala Pro Arg Leu Ala Gly Leu Gly 
1380 1385 1390 

age gat gac gtc ctg ccc gtq ccg gac gqc acc gqg tgg cga ttg gag 
Ser Asp Asp val Leu Pro Val Pro Asp Gly Thr Gly Trp Arg Leu Glu 
1395 1400 1405 



gcg gcc gqg gtq aac ttc egg gat gcg etc ate gcg etc ggt atg tat 
Ala Ala Gly Val Asn Phe Arg Asp Ala Leu lie Ala Leu Gly Met Tyr 
1445 1450 1455 

ccc gqt gtq gca teg ctg gqc agt gag gqc gcc gqg gtc gtq gtq gag 
Pro Gly val Ala Ser Leu Gly Ser Glu Gly Ala Gly val val Val Glu 
1460 1465 1470 

acc gqc ccc gqc gtc acc gqc ctg gca ccc gqc gac cgc gtq atg gga 
Thr Gly Pro Gly Val Thr Gly Leu Ala Pro Gly Asp Arg Val Met Gly 
1475 1480 1485 



ttg aga cca ggg gag teg ttg ctg gtt cat teg gcc gcc ggt ggg gtg 
Leu Arg Pro Gly Glu Ser Leu Leu val His Ser Ala Ala Gly Gly Val 
1540 1545 1550 

ggg atg gcc gcg ate caa etc gcc agg cac etc ggt gca gag gtg tac 
Gly Met Ala Ala lie Gin Leu Ala Arg His Leu Gly Ala Glu val Tyr 
1555 1560 ~ 1565 
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4224 



gcc acg cgc ccg gga age ctg gat ggg ttg gca ttg gtg gac gaa ccg 4272 
Ala Thr Arg Pro Gly ser Leu Asp Gly Leu Ala Leu val Asp Glu Pro 
1410 1415 1420 

acg gcc acg gca ccg ctg ggt gac ggt gag gtc agg att gcg atg cgc 4320 
Thr Ala Thr Ala Pro Leu Gly Asp Gly Glu val Arg lie Ala Met Arg 
1425 1430 1435 1440 



4368 



4416 



4464 



atg ate ccg aag gcg ttc ggg ccg etc gcg gtc gcc gac cat cgc atg 4512 
Met lie Pro Lys Ala Phe Gly Pro Leu Ala val Ala Asp His Arg Met 
1490 1495 1500 

gtg acg agg att ccc get ggt tgg age ttc gcg egg gcc gca teg gtg 4560 
Val Thr Arg lie Pro Ala Gly Trp Ser Phe Ala Arg Ala Ala Ser val 
1505 1510 1515 1520 

ccg ate gtc ttt etc acc gcc tac tac gcg ctg gtt gat etc gcc ggg 4608 
Pro lie val Phe Leu Thr Ala Tyr Tyr Ala Leu Val Asp Leu Ala Gly 
1525 1530 1535 



4656 



4704 
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gcc acc get age gag gac aag tgg caa gec gtq gag ctg age cga gaa 4752 
Ala Thr Ala Ser Glu Asp Lys Trp Gin Ala val Glu Leu Ser Arg Glu 
1570 1575 1580 

cac etc get teg teg egg acg tgc gat ttc gag cag cag ttc etc gqg 4800 
His Leu Ala Ser Ser Arg Thr Cys Asp Phe Glu Gin Gin Phe Leu Gly 
1585 1590 1595 1600 

gca acc gqc gqa cgc gqc gtc gac gtc gtq etc aac tec etc gcc gqg 4848 
Ala Thr Gly Gly Arg Gly Val Asp Val val Leu Asn Ser Leu Ala Gly 
1605 1610 1615 

gag ttc gcc gat gcg tct ctg cga atg ctg ccg cgc gqt gqc cgt ttc 4896 
Glu Phe Ala Asp Ala Ser Leu Arg Met Leu Pro Arg Gly Gly Arg Phe 
1620 1625 ~ 1630 

ctg gag ttg gqg aag acg gat gtt cgt gac ccc gtc gag gtc gcc gat 4944 
Leu Glu Leu Gly Lys Thr Asp val Arg Asp Pro val Glu val Ala Asp 
1635 1640 1645 

gcg cat ccg gqc gtq tct tac cag get ttc gat acc gta gag gca gqc 4992 
Ala His Pro Gly val Ser Tyr Gin Ala Phe Asp Thr Val Glu Ala Gly 
1650 1655 1660 

ccg cag cga ate gqc gag atg ctt cac gag ctg gtq gag ttg ttc gag 5040 
Pro Gin Arg lie Gly Glu Met Leu His Glu Leu val Glu Leu Phe Glu 
1665 1670 1675 1680 

gqa cgc gtq ctg gag ccc ctg cct gtc acg get tgg gac gtt egg cag 5088 
Gly Arg val Leu Glu Pro Leu Pro Val Thr Ala Trp Asp val Arg Gin 
1685 1690 1695 

gcg ccc gag gcg eta egg cac ctg age caa gcg egg cat gtq gqa aag 5136 
Ala Pro Glu Ala Leu Arg His Leu Ser Gin Ala Arg His Val Gly Lys 
1700 1705 " 1710 

ctg gtq etc acc atg cct ccg gtq tgg gac gcc gca gqc acg gtt ctg 5184 
Leu Val Leu Thr Met Pro Pro val Trp Asp Ala Ala Gly Thr val Leu 
1715 1720 1725 

gtt acc gqc gqa acg gqa gca ctt gqc gca gag gtc gcc egg cac etc 5232 
Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Glu Val Ala Arg His Leu 
1730 1735 1740 

gtq ate gag cgc gqg gtq cga aac ctg gtc etc gtc age agg cgc gqt 5280 
Val lie Glu Arg Gly Val Arg Asn Leu val Leu Val Ser Arg Arg Gly 
1745 ' 1750 1755 ~ 1760 

ccc gca gcc agt gqc get get gag etc gtq gcg caa ctg acg gcc tac 5328 
Pro Ala Ala Ser Gly Ala Ala Glu Leu Val Ala Gin Leu Thr Ala Tyr 
1765 1770 1775 

gqt gcc gag gtt tec ttg cag get tgc gat gtc gcc gat cgt gag acc 5376 
Gly Ala Glu val Ser Leu Gin Ala Cys Asp val Ala Asp Arg Glu Thr 
1780 1785 1790 

ttg gcg aag gtq ctt gcc age ate ccg gac gag cat ccg ttg acc gcc 5424 
Leu Ala Lys Val Leu Ala Ser lie Pro Asp Glu His Pro Leu Thr Ala 
1795 1800 1805 

gtq gtq cac gcg get gqt gtt etc gac gac gqa gtq tec gaa teg etc 5472 
val val His Ala Ala Gly val Leu Asp Asp Gly val ser Glu Ser Leu 
1810 1815 1820 
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acc gtq gag egg ctg gac cag gtt ctg cgc ccg aag gtc gat gqc gcg 5520 

Thr VaT Glu Arg Leu Asp Gin val Leu Arg Pro Lys Val Asp Gly Ala 
1825 1830 1835 1840 

egg aat ctg etc gag ctg ate gac ccg gac gtq gec etc gtq ttg ttc 5568 

Arg Asn Leu Leu Glu Leu lie Asp Pro Asp Val Ala Leu Val Leu Phe 
1845 1850 1855 

teg teg gtq teg ggt gtq etc gqc age gqt gqg cag gqt aac tac gcg 5616 

Ser ser val Ser Gly Val Leu Gly Ser Gly Gly Gin Gly Asn Tyr Ala 
1860 1865 ' 1870 

gcg gee aac tec ttc etc gac gca ttg gcg cag caa agg cag teg cgc 5664 

Ala Ala Asn Ser Phe Leu Asp Ala Leu Ala Gin Gin Arg Gin Ser Arg 
1875 1880 1885 

gqc eta ccg acg aga tea ttg gee tgg gqg ccc tgg gcg gaa cat gqc 5712 

Gly Leu Pro Thr Arg Ser Leu Ala Trp Gly Pro Trp Ala Glu His Gly 
1890 1895 1900 

atg gee age acc ttg cgc gaa gee gag cag gat cga ttg gcg cga tct 5760 

Met Ala Ser Thr Leu Arg Glu Ala Glu Gin Asp Arg Leu Ala Arg Ser 
1905 1910 1915 " 1920 

gqg ttg ctg ccg ate teg acc gag gag gqg ttg tec cag ttc gac gee 5808 

Gly Leu Leu Pro lie Ser Thr Glu Glu Gly Leu Ser Gin Phe Asp Ala 
1925 1930 1935 

gcg tgc gqc gqc gcg cat acc gtq gtq gcg ccg gtt cga ttc age cgc 5856 

Ala Cys Gly Gly Ala His Thr Val Val Ala Pro Val Arg Phe Ser Arg 
1940 1945 1950 

ttg tec gac gqg aac gcg ate aag ttc tec gtc ctg caa gqt ttg gtc 5904 

Leu Ser Asp Gly Asn Ala lie Lys Phe Ser val Leu Gin Gly Leu Val 
1955 1960 1965 

gqg ccg cat cgc gtc aac aaa gcg gcg act gcg gat gat gec gag age 5952 

Gly Pro His Arg val Asn Lys Ala Ala Thr Ala Asp Asp Ala Glu Ser 
1970 1975 1980 

etc egg aaa egg ttg gqa cgc ttg ccg gat gca gaa caa cat egg att 6000 

Leu Arg Lys Arg Leu Gly Arg Leu Pro Asp Ala Glu Gin His Arg lie 
1985 1990 ' 1995 2000 

ctg ctg gac etc gtc cgc atg cat gtq gcg gca gtq etc gqa ttc gec 6048 

Leu Leu Asp Leu Val Arg Met His val Ala Ala val Leu Gly Phe Ala 
2005 2010 2015 

gqt tct cag gag ate acc gcg gac gqc acg ttc aag gtq ctg gqc ttc 6096 

Gly Ser Gin Glu lie Thr Ala Asp Gly Thr Phe Lys val Leu Gly Phe 
2020 2025 2030 

gac teg ttg acc gtq gtc gag ttg cgc aac egg ate aac gqg gcg acg 6144 

Asp ser Leu Thr Val val Glu Leu Arg Asn Arg lie Asn Gly Ala Thr 
2035 2040 2045 

gqg ctg cga ctg ccc gec acc ctg gtq ttc aac tac ccg acg ccg gat 6192 

Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asn Tyr Pro Thr Pro Asp 
2050 " 2055 2060 

gcg etc gee gcg cac etc gtc acc gcg ctg tec gca gac cgc ctg gec 6240 

Ala Leu Ala Ala His Leu Val Thr Ala Leu Ser Ala Asp Arg Leu Ala 
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2065 2070 2075 2080 

gqg aca ttc gag gaa etc gac agg tgg gcg gcg aac ctg ccc acg ctg 6288 
Gly Thr Phe Glu Glu Leu Asp Arg Trp Ala Ala Asn Leu Pro Thr Leu 
2085 2090 2095 

gec agg gat gag gec acg egg gcg cag ate acc acc egg eta cag gcg 6336 
Ala Arg Asp Glu Ala Thr Arg Ala Gin lie Thr Thr Arg Leu Gin Ala 
2100 2105 2110 

ate ttg cag age ctg gcg gac gtq tec gqc gqa acc gqc gqc gqc tec 6384 
lie Leu Gin Ser Leu Ala Asp vat Ser Gly Gly Thr Gly Gly Gly Ser 
2115 2120 2125 

gtq ccg gac egg etc aga teg gec acg gac gac gag ctt ttc caa etc 6432 
val Pro Asp Arg Leu Arg Ser Ala Thr Asp Asp Glu Leu Phe Gin Leu 
2130 ~ 2135 2140 

etc gac aac gat etc gaa ctt ccc tga 6459 
Leu Asp Asn Asp Leu Glu Leu Pro 
2145 2150 

<210> 44 
<211> 2152 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 44 

Met Thr val Thr Thr Ser Tyr Glu Glu val val Glu Ala Leu Arg Ala 
15 10 15 

Ser Leu Lys Glu Asn Glu Arg Leu Arg Arg Gly Arg Asp Arg Phe Ser 
20 25 ~ 30 

Ala Glu Lys Asp Asp Pro lie Ala lie val Ala Met Ser Cys Arg Tyr 
35 40 45 

Pro Gly Gin val Ser Ser Pro Glu Asp Leu Trp Gin Leu Ala Ala Gly 
50 55 60 

Gly val Asp Ala lie Ser Glu Val Pro Gly Asp Arg Gly Trp Asp Leu 
65 70 75 80 

Asp Gly val Phe Val Pro Asp Ser Asp Arg Pro Gly Thr Ser Tyr Ala 
85 90 95 

Cys Ala Gly Gly Phe Leu Gin Gly val Ser Glu Phe Asp Ala Gly Phe 
100 105 110 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
115 120 125 

Leu Leu Leu Glu Val Ala Trp Glu Val Phe Glu Arg Ala Gly Leu Glu 
130 135 140 

Gin Arg Ser Thr Arg Gly Ser Arg val Gly val Phe val Gly Thr Asn 
145 " 150 155 160 

Gly Gin Asp Tyr Ala Ser Trp Leu Arg Thr Pro Pro Pro Ala val Ala 
165 170 175 

Gly His val Leu Thr Gly Gly Ala Ala Ala Val Leu Ser Gly Arg val 
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180 185 190 

Ala Tyr ser Phe Gly Phe Glu Gly Pro Ala val Thr val Asp Thr Ala 
195 200 205 

Cys Ser ser Ser Leu val Ala Leu His Leu Ala Gly Gin Ala Leu Arg 
210 215 220 

Ala Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr val Met Ser 
225 230 235 240 

Thr Pro Lys val Phe Leu Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro 
245 250 " 255 

Asp Gly Arg cys Lys Ser Phe Ala Ala Gly Ala Asp Gly Thr Gly Trp 
260 265 270 

Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg 
275 280 285 

Arg Asn Gly His Glu val Leu Ala val val Arg Gly Ser Ala val Asn 
290 295 300 

Gin Asp Gly Ala ser Asn Gly Leu Thr Ala Pro Asn Gly ser ser Gin 
305 310 315 320 

Gin Arg val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser val Ser 
325 330 335 

Asp val Asp Ala val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp 
340 345 350 

Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg Asp 
355 360 365 

Pro Gly Arg Pro Leu Trp Leu Gly Ser val Lys Ser Asn lie Gly His 
370 375 380 

Thr Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val Met Ala 
385 390 395 400 

Met Arg His Gly Gin Leu Pro Arg Thr Leu His val Glu ser Pro Ser 
405 " 410 415 

Pro Glu val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu Asn 
420 425 430 

Thr Pro Trp Pro Arg Ser Gly Arg val Arg Arg Val Gly val ser Ser 
435 440 445 

Phe Gly lie Ser Gly Thr Asn Ala His val lie Leu Glu Gin Pro Pro 
450 455 460 

Gly val Pro ser Gin ser Ala Gly Pro Gly Ser Gly Ser val val Asp 
465 470 475 480 

val Pro Val val Pro Trp Met Val Ser Gly Lys Thr Pro Glu Ala Leu 
485 490 495 

Ser Ala Gin Ala Thr Ala Leu Met Thr Tyr Leu Asp Glu Arg Pro Asp 
500 505 510 

val ser Ser Leu Asp val Gly Tyr Ser Leu Ala Leu Thr Arg ser Ala 
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515 520 525 

Leu Asp Glu Arg Ala val val Leu Gly ser Asp Arg Glu Thr Leu Leu 
530 ' 535 540 

Cys Gly Val Lys Ala Leu Ser Ala Gly His Glu Ala Ser Gly Leu val 
545 550 555 " 560 

Thr Gly ser val Gly Ala Gly Gly Arg lie Gly Phe val Phe Ser Gly 
565 570 575 

Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu Tyr Arg Ala Phe 
580 585 590 

Pro val Phe Ala Ala Ala Phe Asp Glu Ala cys Ala Glu Leu Asp Ala 
595 600 605 

His Leu Gly Gin Glu lie Gly Val Arg Glu val Val Ser Gly Ser Asp 
610 615 620 

Ala Gin Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

Leu Gin val Gly Leu Leu Lys Leu Leu Asp Ser Trp Gly Val Arg Pro 
645 650 655 

Ser val val Leu Gly His Ser val Gly Glu Leu Ala Ala Ala Phe Ala 
660 665 670 

Ala Gly val val Ser Leu Ser Gly Ala Ala Arg Leu Val Ala Gly Arg 
675 680 ~ 685 

Ala Arg Leu Met Gin Ala Leu Pro ser Gly Gly Gly Met Leu Ala val 
690 695 700 

Pro Ala Gly Glu Glu Leu Leu Trp Ser Leu Leu Ala Asp Gin Gly Asp 
705 710 715 720 

Arg val Gly lie Ala Ala Val Asn Ala Ala Gly Ser val val Leu Ser 
725 730 735 

Gly Asp Arg Asp val Leu Asp Asp Leu Ala Gly Arg Leu Asp Gly Gin 
740 745 ' 750 

Gly lie Arg ser Arg Trp Leu Arg val Ser His Ala Phe His Ser Tyr 
755 " 760 765 

Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Thr val 
770 775 780 

Asp Tyr Arg Arg cys Glu val Pro lie val Ser Thr Leu Thr Gly Asp 
785 790 795 800 

Leu Asp Asp Ala Gly Arg Met Ser Gly Pro Asp Tyr Trp val Arg Gin 
805 810 815 

Val Arg Glu Pro val Arg Phe Ala Asp Gly val Gin Ala Leu val Glu 
820 825 830 

His Asp Val Ala Thr Val val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

Ala Leu lie Gin Glu Cys val Ala Ala Ser Asp His Ala Gly Arg Leu 
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850 855 860 

Ser Ala Val Pro Ala Met Arg Arg Asn Gin Asp Glu Ala Gin Lys Val 
865 870 ~ ~ 875 880 

Met Thr Ala Leu Ala His val His Val Arg Gly Gly Ala val Asp Trp 
885 890 895 

Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin lie Glu Leu Pro Thr 
900 905 910 

Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Val Pro Ser Asp Ser Gly 
915 920 925 

Asp val Thr Gly Ala Gly Leu Ala Gly Ala Glu His Pro Leu Leu Gly 
930 935 940 

Ala Val val Pro Val Ala Gly Gly Asp Glu val Leu Leu Thr Gly Arg 
945 950 955 960 

lie Ser val Arg Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly 
965 970 ~ 975 

Glu val lie val Ala Gly Thr Ala Leu Leu Glu lie Ala Leu His Ala 
980 985 990 

Gly Glu Arg Leu Gly Cys Glu Arg val Glu Glu Leu Thr Leu Glu Ala 
995 1000 1005 

Pro Leu Val Leu Pro Glu Arg Gly Ala lie Gin Val Gin Leu Arg val 
1010 1015 1020 

Gly Ala Pro Glu Asn Ser Gly Arg Arg Pro Met Ala Leu Tyr Ser Arg 
025 1030 1035 1040 

Pro Glu Gly Ala Ala Glu His Asp Trp Thr Arg His Ala Thr Gly Arg 
1045 1050 ~ 1055 

Leu Ala Pro Gly Arg Gly Glu Ala Ala Gly Asp Leu Ala Asp Trp Pro 
1060 1065 1070 

Ala Pro Gly Ala Leu Pro Val Asp Leu Asp Glu Phe Tyr Arg Asp Leu 
1075 1080 1085 

Ala Glu Leu Gly Leu Glu Tyr Gly Pro lie Phe Gin Gly Leu Lys Ala 
1090 1095 1100 

Ala Trp Arg Gin Gly Asp Glu Val Tyr Ala Glu Ala Ala Leu Pro Gly 
105 " 1110 1115 1120 

Thr Glu Asp ser Gly Phe Gly val His Pro Ala Leu Leu Asp Ala Ala 
1125 1130 1135 

Leu His Ala Thr Ala Val Arg Asp Met Asp Asp Ala Arg Leu Pro Phe 
1140 1145 1150 

Gin Trp Glu Gly Val Ser Leu His Ala Lys Ala Ala Pro Ala Leu Arg 
1155 1160 1165 

val Arg val val Pro Ala Gly Asp Asp Ala Lys ser Leu Leu Val Cys 
1170 1175 1180 

Asp Gly Thr Gly Arg Pro Val lie Ser Val Asp Arg Leu Val Leu Arg 
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185 1190 1195 1200 

Ser Ala Ala Ala Arg Arg Thr Gly Ala Arg Arg Gin Ala His Gin Ala 
1205 1210 1215 

Arg Leu Tyr Arg Leu Ser Trp Pro Thr val Gin Leu Pro Thr Ser Ala 
1220 1225 1230 

Gin Pro Pro Ser Cys Val Leu Leu Gly Thr Ser Glu val Ser Ala Asp 
1235 1240 1245 

lie Gin val Tyr Pro Asp Leu Arg Ser Leu Thr Ala Ala Leu Asp Ala 
1250 1255 1260 

Gly Ala Glu Pro Pro Gly val Val lie Ala Pro Thr Pro Pro Gly Gly 
265 1270 1275 1280 

Gly Arg Thr Ala Asp Val Arg Glu Thr Thr Arg His Ala Leu Asp Leu 
1285 ~ 1290 1295 

val Gin Gly Trp Leu Ser Asp Gin Arg Leu Asn Glu Ser Arg Leu Leu 
1300 1305 1310 

Leu val Thr Gin Gly Ala Val Ala Val Glu Pro Gly Glu Pro Val Thr 
1315 1320 1325 

Asp Leu Ala Gin Ala Ala Leu Trp Gly Leu Leu Arg Ser Thr Gin Thr 
1330 1335 1340 

Glu His Pro Asp Arg Phe val Leu val Asp val Pro Glu Pro Ala Gin 
345 1350 1355 1360 

Leu Leu Pro Ala Leu Pro Gly Val Leu Ala Cys Gly Glu Pro Gin Leu 
1365 1370 1375 

Ala Leu Arg Arg Gly Gly Ala His Ala Pro Arg Leu Ala Gly Leu Gly 
1380 1385 1390 

Ser Asp Asp val Leu Pro Val Pro Asp Gly Thr Gly Trp Arg Leu Glu 
1395 1400 1405 

Ala Thr Arg Pro Gly ser Leu Asp Gly Leu Ala Leu Val Asp Glu Pro 
1410 1415 1420 

Thr Ala Thr Ala Pro Leu Gly Asp Gly Glu Val Arg lie Ala Met Arg 
425 1430 1435 1440 

Ala Ala Gly Val Asn Phe Arg Asp Ala Leu lie Ala Leu Gly Met Tyr 
1445 1450 1455 

Pro Gly val Ala ser Leu Gly Ser Glu Gly Ala Gly Val Val Val Glu 
1460 ' 1465 1470 

Thr Gly Pro Gly val Thr Gly Leu Ala Pro Gly Asp Arg val Met Gly 
1475 1480 1485 

Met lie Pro Lys Ala Phe Gly Pro Leu Ala Val Ala Asp His Arg Met 
1490 1495 1500 

val Thr Arg lie Pro Ala Gly Trp Ser Phe Ala Arg Ala Ala Ser val 
505 ~ 1510 1515 1520 

Pro lie Val Phe Leu Thr Ala Tyr Tyr Ala Leu val Asp Leu Ala Gly 
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1525 1530 1535 

Leu Arg Pro Gly Glu Ser Leu Leu val His Ser Ala Ala Gly Gly val 
1540 1545 1550 

Gly Met Ala Ala lie Gin Leu Ala Arg His Leu Gly Ala Glu val Tyr 
1555 1560 1565 

Ala Thr Ala ser Glu Asp Lys Trp Gin Ala val Glu Leu Ser Arg Glu 
1570 1575 1580 

His Leu Ala Ser ser Arg Thr Cys Asp Phe Glu Gin Gin Phe Leu Gly 
585 1590 1595 1600 

Ala Thr Gly Gly Arg Gly val Asp val val Leu Asn Ser Leu Ala Gly 
1605 1610 1615 

Glu Phe Ala Asp Ala Ser Leu Arg Met Leu Pro Arg Gly Gly Arg Phe 
1620 1625 1630 

Leu Glu Leu Gly Lys Thr Asp Val Arg Asp Pro val Glu Val Ala Asp 
1635 1640 1645 

Ala His Pro Gly val Ser Tyr Gin Ala Phe Asp Thr val Glu Ala Gly 
1650 1655 1660 

Pro Gin Arg lie Gly Glu Met Leu His Glu Leu val Glu Leu Phe Glu 
665 " 1670 1675 1680 

Gly Arg val Leu Glu Pro Leu Pro Val Thr Ala Trp Asp val Arg Gin 
1685 1690 1695 

Ala Pro Glu Ala Leu Arg His Leu Ser Gin Ala Arg His val Gly Lys 
1700 ~ 1705 1710 

Leu Val Leu Thr Met Pro Pro Val Trp Asp Ala Ala Gly Thr Val Leu 
1715 1720 1725 

val Thr Gly Gly Thr Gly Ala Leu Gly Ala Glu val Ala Arg His Leu 
1730 1735 1740 

val He Glu Arg Gly val Arg Asn Leu val Leu val Ser Arg Arg Gly 
745 1750 1755 * 1760 

Pro Ala Ala Ser Gly Ala Ala Glu Leu Val Ala Gin Leu Thr Ala Tyr 
1765 1770 1775 

Gly Ala Glu val Ser Leu Gin Ala Cys Asp Val Ala Asp Arg Glu Thr 
1780 1785 1790 

Leu Ala Lys val Leu Ala Ser lie Pro Asp Glu His Pro Leu Thr Ala 
1795 1800 1805 

val val His Ala Ala Gly val Leu Asp Asp Gly Val Ser Glu Ser Leu 
1810 1815 1820 

Thr val Glu Arg Leu Asp Gin val Leu Arg Pro Lys val Asp Gly Ala 
825 1830 1835 1840 

Arg Asn Leu Leu Glu Leu lie Asp Pro Asp val Ala Leu val Leu Phe 
1845 1850 1855 

ser ser val ser Gly val Leu Gly ser Gly Gly Gin Gly Asn Tyr Ala 
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1860 1865 1870 

Ala Ala Asn Ser Phe Leu Asp Ala Leu Ala Gin Gin Arg Gin Ser Arg 
1875 1880 1885 

Gly Leu Pro Thr Arg Ser Leu Ala Trp Gly Pro Trp Ala Glu His Gly 
1890 1895 1900 

Met Ala ser Thr Leu Arg Glu Ala Glu Gin Asp Arg Leu Ala Arg ser 
905 1910 1915 1920 

Gly Leu Leu Pro He Ser Thr Glu Glu Gly Leu Ser Gin Phe Asp Ala 
1925 1930 1935 

Ala Cys Gly Gly Ala His Thr val val Ala Pro val Arg Phe Ser Arg 
1940 1945 1950 

Leu Ser Asp Gly Asn Ala lie Lys Phe Ser Val Leu Gin Gly Leu Val 
1955 1960 1965 

Gly Pro His Arg val Asn Lys Ala Ala Thr Ala Asp Asp Ala Glu Ser 
1970 ~ 1975 1980 

Leu Arg Lys Arg Leu Gly Arg Leu Pro Asp Ala Glu Gin His Arg lie 
985 " 1990 1995 2000 

Leu Leu Asp Leu val Arg Met His val Ala Ala val Leu Gly Phe Ala 
2005 2010 2015 

Gly Ser Gin Glu lie Thr Ala Asp Gly Thr Phe Lys val Leu Gly Phe 
2020 2025 2030 

Asp ser Leu Thr Val val Glu Leu Arg Asn Arg lie Asn Gly Ala Thr 
2035 2040 ~ 2045 

Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asn Tyr Pro Thr Pro Asp 
2050 2055 2060 

Ala Leu Ala Ala His Leu val Thr Ala Leu Ser Ala Asp Arg Leu Ala 
065 2070 2075 ~ 2080 

Gly Thr Phe Glu Glu Leu Asp Arg Trp Ala Ala Asn Leu Pro Thr Leu 
2085 " 2090 2095 

Ala Arg Asp Glu Ala Thr Arg Ala Gin lie Thr Thr Arg Leu Gin Ala 
2100 2105 2110 

lie Leu Gin Ser Leu Ala Asp Val Ser Gly Gly Thr Gly Gly Gly Ser 
2115 2120 2125 

val Pro Asp Arg Leu Arg ser Ala Thr Asp Asp Glu Leu Phe Gin Leu 
2130 2135 2140 

Leu Asp Asn Asp Leu Glu Leu Pro 
145 2150 



<210> 45 
<211> 9513 
<212> DNA 

<213> Saccharopolyspora spinosa 
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<220> 

<221> CDS 

<222> (1) . . (9510) 

<223> ORF20; polyketide synthase 

<400> 45 

atg teg aat gaa gag aag etc egg gag tac ttg egg cgt gcg etc gtq 48 

Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala Leu val 
1 5 10 ~ ~ 15 

gat ctg cac cag gcg cgc gag egg ctg cac gag gcg gag teg gqa gag 96 
Asp Leu His Gin Ala Arg Glu Arg Leu His Glu Ala Glu Ser Gly Glu 
20 25 30 

egg gaa ccc ate gcg ate gtq gcg atg gqc tgc egg tac ccg gqt gqg 144 
Arg Glu pro lie Ala lie Val Ala Met Gly cys Arg Tyr Pro Gly Gly 
35 40 45 

gtq cag gac ccg gaa gqg ctg tgg aaa ctg gtc gec tec gqt gqc gac 192 
val Gin Asp Pro Glu Gly Leu Trp Lys Leu Val Ala Ser Gly Gly Asp 
50 55 60 

gec ate gqt gaa ttc ccc get gat cgt gqt tgg cac etc gac gag etc 240 
Ala lie Gly Glu Phe Pro Ala Asp Arg Gly Trp His Leu Asp Glu Leu 
65 70 ~ 75 80 

tac gat ccc gac ccg gat cag ccc gqa acc tgc tac acc egg cac gqc 288 
Tyr Asp Pro Asp Pro Asp Gin Pro Gly Thr cys Tyr Thr Arg His Gly 
85 90 95 

gqc ttc etc cac gac gec gqc gag ttc gac gcg gqa ttc ttc gac ate 336 
Gly Phe Leu His Asp Ala Gly Glu Phe Asp Ala Gly Phe Phe Asp lie 
100 ' 105 110 

age ccc cgt gag gcg etc gcg atg gac ccg cag cag egg ctg ctg ctg 384 
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

gaa ate tec tgg gag acc gtc gaa tec get gqg atg gac ccg agg tec 432 
Glu lie Ser Trp Glu Thr val Glu Ser Ala Gly Met Asp Pro Arg Ser 
130 135 140 

ttg egg gqg age cgc acc gqg gtq ttc gcg gqa ttg atg tac gag gqc 480 
Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr Glu Gly 
145 150 * 155 160 

tat gac acc gqc gec cac egg gca gqa gaa gqt gtc gaa gqc tat etc 528 
Tyr Asp Thr Gly Ala His Arg Ala Gly Glu Gly Val Glu Gly Tyr Leu 
165 170 175 



gqa acc gqc aat gcg gqa age gtc gee tct gqt egg gtt gcg tat gcg 
Gly Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Ala 
180 185 " 190 



576 



ttc gqg ttc gag gqc cca gcg gtq acg gta gac acg gcg tgc teg teg 624 

Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 

195 200 205 

teg ttg gtq gcg ctg cat ttg gcg tgt cag teg ttg egg cag gqc gag 672 

Ser Leu val Ala Leu His Leu Ala Cys Gin ser Leu Arg Gin Gly Glu 
210 215 220 

tgt gat ctg gcg ctg gee ggt gga gtg acg gtg atg teg acg ccg gag 720 
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Cys Asp Leu Ala Leu Ala Gly Gly Val Thr val Met Ser Thr Pro Glu 
225 230 235 240 

agg ttc gtq gag ttc tec cgt cag cgt gqt etc gca ccg gat gqg egg 768 

Arg Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 ~ 250 255 



tgt aag teg ttc gcg gcg get gcg gat gqa ace gqt tgg gqt gag gqt 
cys Lys ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 



get gtq gag gcg cat gqa acg gqt acg egg ctt gqt gat ccg ate gag 
Ala val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu 
340 345 " 350 



816 



gee gqt ttg gtq ttg ctg gag egg ctg tea gac gee agg egg aac gqg 864 

Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

cat egg gta ctg gcg gtt gtt cgt gqt age gcg gtq aat cag gac gqt 912 

His Arg val Leu Ala val Val Arg Gly Ser Ala val Asn Gin Asp Gly 
290 295 " 300 

gcg teg aac gqa ttg acg gec ccg aac gqg ctg gec cag gag egg gtc 

Ala ser Asn Gly Leu Thr Ala Pro Asn Gly Leu Ala Gin Glu Arg Val 
305 310 315 320 



960 



att cag cag gtq etc acg agt gcg gqg ctg teg gcg tec gat gtq gac 1008 
lie Gin Gin val Leu Thr Ser Ala Gly Leu Ser Ala Ser Asp Val Asp 
325 330 335 



1056 



gcg cag get ctg ata gec gee tat gqa cag gat egg gac egg gac egg 1104 

Ala Gin Ala Leu lie Ala Ala Tyr Gly Gin Asp Arg Asp Arg Asp Arg 
355 360 365 

ccg ctg tgg ttg gqg teg gtc aag tec aac ate gqt cat acg cag gcg 1152 

Pro Leu Trp Leu Gly Ser val Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

get gcg gqc gtc get gqt gtq ate aag atg gtc atg gcg atg egg cac 

Ala Ala Gly Val Ala Gly Val lie Lys Met Val Met Ala Met Arg His 

385 390 395 400 

gqg gag ctg ccg cgc acg ttg cac gtq gac gag ccg aat teg cac gtq 

Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro Asn Ser His val 

405 410 415 



1200 



1248 



gac tgg teg get gqt gcg gtc cga etc ctg acc gag aac ate cgc tgg 1296 

Asp Trp Ser Ala Gly Ala val Arg Leu Leu Thr Glu Asn lie Arg Trp 
420 425 430 

cca gqg acg gqt acg cgc cgc get gqa gtq teg teg ttc gqg gta age 1344 

Pro Gly Thr Gly Thr Arg Arg Ala Gly val Ser Ser Phe Gly Val Ser 
435 440 445 

gqt acc aac gca cac gtc ate etc gaa cac gac ccg etc gee gtq acc 

Gly Thr Asn Ala His val lie Leu Glu His Asp Pro Leu Ala Val Thr 

450 455 460 



1392 



gag aac gag gaa gca gcg cag tec cca gca cct gqg ate gtq ccc tgg 1440 

Glu Asn Glu Glu Ala Ala Gin ser Pro Ala Pro Gly lie Val Pro Trp 
465 470 475 480 
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gcg ttg tec gqg egg teg teg acg gcg ctg egg gee cag gee gaa egg 1488 

Ala Leu Ser Gly Arg Ser Ser Thr Ala Leu Arg Ala Gin Ala Glu Arg 

485 490 ~ 495 

ctg cgc gag ctg tgc gag cag acc gat ccc gac ccc gtc gat gtc gqt 1536 

Leu Arg Glu Leu Cys Glu Gin Thr Asp Pro Asp Pro val Asp val Gly 

500 505 510 

ttc tea ctg gee gee acg cgc acg get tgg gag cac cga gcg gtq gtq 1584 

Phe Ser Leu Ala Ala Thr Arg Thr Ala Trp Glu His Arg Ala val val 
515 520 525 

ctt gqt egg gac age get acg ttg cgc tec gqg ctt gqc gtt gtt gee 1632 

Leu Gly Arg Asp Ser Ala Thr Leu Arg Ser Gly Leu Gly val Val Ala 
530 535 540 

age gqt gaa cca gcg gtc gat gtc gtt gag gqg age gtc ctg gac gqc 1680 

Ser Gly Glu Pro Ala val Asp val val Glu Gly Ser Val Leu Asp Gly 

545 550 555 560 



gag gtc gtc ttc gtc ttc ccc gqt cag gqc tgg cag tgg gee gqt atg 
Glu val val Phe Val Phe Pro Gly Gin Gly Trp Gin Trp Ala Gly Met 
565 570 575 



gtq ctg cgc gqa gcg gag aac tec cca ccg ctg gac egg gtq gac gtq 
val Leu Arg Gly Ala Glu Asn Ser Pro Pro Leu Asp Arg val Asp val 
610 ~ 615 620 

etc cag ccc gcg tec ttc gcg gtq atg gtq teg etc gee gag gtq tgg 
Leu Gin Pro Ala Ser Phe Ala Val Met Val Ser Leu Ala Glu val Trp 
625 630 635 640 



ttg ttc gcg gqa teg gqc gqc cgt ctg gaa gtt gcg gcg ate aac gqc 
Leu Phe Ala Gly Ser Gly Gly Arg Leu Glu Val Ala Ala lie Asn Gly 
705 710 715 720 



1728 



gca gtc gac ctg ctg gac get teg ccg acg ttc gcg cgc cac atg gac 1776 

Ala Val Asp Leu Leu Asp Ala Ser Pro Thr Phe Ala Arg His Met Asp 

580 585 590 

gag tgc gec acc gcg ctg egg agg tac gtq gac tgg teg ttg gtc gac 1824 

Glu cys Ala Thr Ala Leu Arg Arg Tyr val Asp Trp Ser Leu val Asp 
595 600 605 



1872 



1920 



cgt tec tac gqg gtq agg ccg gcg gec gtc gtc gqc cac agt caa gqc 1968 

Arg ser Tyr Gly val Arg Pro Ala Ala Val Val Gly His Ser Gin Gly 

645 ~ 650 655 

gaa ate gec gcg gee tgc gca gec gqg gtq ctg ccg ctg gag gat gcg 2016 

Glu He Ala Ala Ala Cys Ala Ala Gly val Leu Pro Leu Glu Asp Ala 

660 665 670 

gec agg ctt gtc gca ttg cgc age aga gcg ttg aag gqa ctt teg gqg 2064 

Ala Arg Leu val Ala Leu Arg Ser Arg Ala Leu Lys Gly Leu Ser Gly 

675 680 685 

egg gqt gqc atg gcg teg ctg gee tgc cct gcg gat gag gtc gcg gca 2112 

Arg Gly Gly Met Ala Ser Leu Ala Cys Pro Ala Asp Glu val Ala Ala 
690 695 700 



2160 



ccg cga teg gtc gtq gtq tec gqc gat ctg gaa gcg gtq gac gaa ctg 2208 
Pro Arg Ser val val val Ser Gly Asp Leu Glu Ala Val Asp Glu Leu 
725 730 735 
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ctg gca gag tgc get gaa aag gac atg cgt gca cgc cgt ate ccc gtc 2256 

Leu Ala Glu cys Ala Glu Lys Asp Met Arg Ala Arg Arg lie Pro val 

740 745 ~ 750 



gac tac gec teg cat tea gcg cac gtq gag gtq gtt egg age ccg gtq 

Asp Tyr Ala Ser His ser Ala His val Glu val val Arg Ser Pro VaT 

755 760 765 

ctg gcg gec gec gec gqg gtq cga cac egg gac gqc cag gtq ccg tgg 

Leu Ala Ala Ala Ala Gly val Arg His Arg Asp Gly Gin Val Pro Trp 

770 775 780 



gtq cag gqc ctg gtc gag egg gqa ttc gqc ctg ttc ate gaa atg agt 
Val Gin Gly Leu val Glu Arg Gly Phe Gly Leu Phe lie Glu Met ser 
820 825 830 



gag acc gec gtq gee gcg gta gqt ace ttg cga cgt gac teg gqc gqc 
Glu Thr Ala VaT Ala Ala val Gly Thr Leu Arg Arg Asp Ser Gly Gly 
850 855 860 



2304 



2352 



tgg teg acg gtq ate gqc gac tgg gtq gat ccg gec agg ctg gac gqc 2400 

Trp ser Thr val lie Gly Asp Trp VaT Asp Pro Ala Arg Leu Asp Gly 
785 790 795 800 

gag tat tgg tat egg aac etc egg cag ccg gtc egg ttc gaa cac gec 2448 

Glu Tyr Trp Tyr Arg Asn Leu Arg Gin Pro val Arg Phe Glu His Ala 
805 810 815 



2496 



gcg cat ccg gtq ctg acc acg gcg gtc gag gaa acc gqt gcg gag teg 2544 
Ala His Pro val Leu Thr Thr Ala val Glu Glu Thr Gly Ala Glu Ser 
835 840 845 



2592 



etc egg agg ttg ttg cat teg ctg gee gag gcg tac gtq cgc gqc gec 2640 
Leu Arg Arg Leu Leu His Ser Leu Ala Glu Ala Tyr Val Arg Gly Ala 
865 870 875 ~ 880 

acc gtq gac tgg gec gtq gcg ttc gqg gqc gcg gqc cga egg ctg gac 2688 
Thr Val Asp Trp Ala val Ala Phe Gly Gly Ala Gly Arg Arg Leu Asp 
885 890 895 

ctg ccg acc tac ccg ttc cag cgc cag egg tac tgg ctg gac aag gqa 2736 
Leu Pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Asp Lys Gly 
900 905 ^ 910 

get gee tec gac gag get cgt gcg gtc teg gac ccg gcg gcg gqc tgg 2784 
Ala Ala Ser Asp Glu Ala Arg Ala val Ser Asp Pro Ala Ala Gly Trp 
915 920 925 

ttc tgg caa gec gtq gcg cgc caa gac ctg aaa age gtq tec gat gec 2832 
Phe Trp Gin Ala Val Ala Arg Gin Asp Leu Lys Ser val Ser Asp Ala 
930 935 940 

etc gat etc gac gec gac gca ccg ctg age gca aca ctt cca gec ctg 2880 
Leu Asp Leu Asp Ala Asp Ala Pro Leu Ser Ala Thr Leu Pro Ala Leu 
945 950 955 960 

tec gtc tgg cac cgt cag gaa cga gaa agg gtc ttg gca gac gqt tgg 2928 
Ser val Trp His Arg Gin Glu Arg Glu Arg Val Leu Ala Asp Gly Trp 
965 970 975 

egg tac cga gtc gac tgg gta egg gtq gee ccg cag ccg gtc egg aga 2976 
Arg Tyr Arg val Asp Trp val Arg val Ala Pro Gin Pro Val Arg Arg 

Page 153 



LeA33955-US 

980 985 990 

acg egg gaa acc tgg etc ctg gtc gtt ccc ccg gqc gqc ate gag gaa 3024 
Thr Arg Glu Thr Trp Leu Leu val val Pro Pro Gly Gly lie Glu Glu 
995 1000 1005 

gcg ctg gtc gaa egg ctg acg gat gcg ttg aac acg cga gqg ate age 3072 
Ala Leu val Glu Arg Leu Thr Asp Ala Leu Asn Thr Arg Gly He Ser 
1010 1015 1020 

acc ctg cgc etc gac gtq cca ccg gcg gcg acc agt gqc gaa etc gca 3120 
Thr Leu Arg Leu Asp Val Pro Pro Ala Ala Thr Ser Gly Glu Leu Ala 
1025 1030 1035 1040 

acc gaa etc cgc gee gca gee gac gqt gac ccg gtq aag gca ate ctg 3168 
Thr Glu Leu Arg Ala Ala Ala Asp Gly Asp Pro val Lys Ala lie Leu 
1045 1050 1055 

teg etc acc gcg ttg gac gag cga ccc cac ccc gaa tgc aag gac gtc 3216 
Ser Leu Thr Ala Leu Asp Glu Arg Pro His Pro Glu Cys Lys Asp val 
1060 1065 1070 

ccg age gqg att gee ttg ctg ctg aac ctg gtc aag gcg etc gqt gaa 3264 
Pro Ser Gly lie Ala Leu Leu Leu Asn Leu val Lys Ala Leu Gly Glu 
1075 1080 1085 

gee gac etc aga att cct ctg tgg acc ate acg cgt gqt gcg gtc aag 3312 
Ala Asp Leu Arg lie Pro Leu Trp Thr lie Thr Arg Gly Ala val Lys 
1090 1095 1100 

gca gqc ccc gca gat egg ctg ctg cgc ccg atg cag gcg caa gca tgg 3360 
Ala Gly Pro Ala Asp Arg Leu Leu Arg Pro Met Gin Ala Gin Ala Trp 
1105 1110 ~ 1115 1120 

gqt ctg gqg cga gta gec gca etc gaa cac ccc gag cgc tgg gqt gqg 3408 
Gly Leu Gly Arg val Ala Ala Leu Glu His Pro Glu Arg Trp Gly Gly 
1125 1130 1135 

ctg ate gac ctg ccg gat teg ctg gac gqc gac gtc etc acg agg ctg 3456 
Leu lie Asp Leu Pro Asp Ser Leu Asp Gly Asp val Leu Thr Arg Leu 
1140 1145 1150 

gqc gaa gcg etc acc aac gqc ttg gcg gaa gac caa ctg gcg att cgc 3504 
Gly Glu Ala Leu Thr Asn Gly Leu Ala Glu Asp Gin Leu Ala lie Arg 
1155 1160 1165 

cag teg gqc gtq ctg gee egg cga ctg gta ccc gec ccg gcg aat cag 3552 
Gin ser Gly Val Leu Ala Arg Arg Leu val Pro Ala Pro Ala Asn Gin 
1170 1175 1180 

ccc get gqa cgt aag tgg cgc ccc cga gqg age gcg ctg ate acg gqc 3600 
Pro Ala Gly Arg Lys Trp Arg Pro Arg Gly Ser Ala Leu lie Thr Gly 
1185 1190 ~ 1195 1200 

gqa etc gqc gcg gtq gqc gca cag gtq gcg agg tgg ttg gee gaa ate 3648 
Gly Leu Gly Ala val Gly Ala Gin val Ala Arg Trp Leu Ala Glu lie 
1205 1210 1215 

gqa gec gag cga ate gtq etc acc agt cga egg gqc aac caa gca gca 3696 
Gly Ala Glu Arg lie Val Leu Thr ser Arg Arg Gly Asn Gin Ala Ala 
1220 1225 ~ ~ 1230 

ggc gec gec gag ctg gaa gee gaa etc egg gee ctt gga gcg caa gtg 3744 
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Gly Ala Ala Glu Leu Glu Ala Glu Leu Arg Ala Leu Gly Ala Gin Val 
1235 1240 1245 

tec ate gtq get tgc gac gtq acc gat cgt gec gag atg tec gca eta 3792 
Ser lie val Ala Cys Asp Val Thr Asp Arg Ala Glu Met Ser Ala Leu 
1250 1255 1260 

ctg gec gag ttc gac gtc acc gcg gtq ttc cac gcg gec gqa gtc gqt 3840 
Leu Ala Glu Phe Asp Val Thr Ala val Phe His Ala Ala Gly val Gly 
1265 1270 1275 1280 

egg ctg ctg ccg ttg gcg gag acc gac cag aac gqc ctg gee gaa ata 3888 
Arg Leu Leu Pro Leu Ala Glu Thr Asp Gin Asn Gly Leu Ala Glu lie 
1285 1290 * 1295 

tgc gcg gcg aag gtc cgc gqc get cag gtq ctg gac gaa ctg tgc gac 3936 
Cys Ala Ala Lys val Arg Gly Ala Gin Val Leu Asp Glu Leu Cys Asp 
1300 1305 1310 

age acc gat etc gat gee ttc gtc ctg ttc tec teg gqt gee gqg gta 3984 
Ser Thr Asp Leu Asp Ala Phe val Leu Phe Ser Ser Gly Ala Gly val 
1315 1320 1325 

tgg ggc ggg ggc ggt cag gqc get tac gqc gcg gcg aac gca ttc ttg 4032 
Trp Gly Gly Gly Gly Gin Gly Ala Tyr Gly Ala Ala Asn Ala Phe Leu 
1330 1335 1340 

gac aca etc gee gaa caa cgc cga gca cgc gqt ctg ccg gca acc teg 4080 
Asp Thr Leu Ala Glu Gin Arg Arg Ala Arg Gly Leu Pro Ala Thr Ser 
1345 1350 " 1355 1360 

ate tec tgg gqc agt tgg gec gqc gqc gqc atg gee gac gqc gcg gcg 4128 
lie Ser Trp Gly Ser Trp Ala Gly Gly Gly Met Ala Asp Gly Ala Ala 
1365 1370 1375 

gqc gaa cac ctg egg cga cgc gqg ata cgt ccg atg ccg gcg gcg teg 4176 
Gly Glu His Leu Arg Arg Arg Gly lie Arg Pro Met Pro Ala Ala ser 
1380 1385 1390 

gec ate ctg get ctg cag gaa gta ctt gac cag gat gag acg tgc gtq 4224 
Ala lie Leu Ala Leu Gin Glu val Leu Asp Gin Asp Glu Thr Cys Val 
1395 1400 1405 

teg ate get gat gtq gac tgg gac cga ttc gtt ccc acg ttc gec gcg 4272 
Ser lie Ala Asp Val Asp Trp Asp Arg Phe val Pro Thr Phe Ala Ala 
1410 1415 1420 

act cgc gec acc egg ttg ttc gac gaa gtq ccg gcg gcg aga aag gcg 4320 
Thr Arg Ala Thr Arg Leu Phe Asp Glu Val Pro Ala Ala Arg Lys Ala 
1425 1430 1435 1440 

atg ccc gcg aat gqg ccg gca gaa cca gqc gqc teg ccg ttc gec cgc 4368 
Met Pro Ala Asn Gly Pro Ala Glu Pro Gly Gly Ser Pro Phe Ala Arg 
1445 1450 1455 

aat etc gcg gag ctg ccg gaa gee caa cga cgc cac gaa ctg gtq gat 4416 
Asn Leu Ala Glu Leu Pro Glu Ala Gin Arg Arg His Glu Leu vat Asp 
1460 1465 " 1470 

ctg gtq tgc gee cag gtq gca acc gtq etc gqg cac gqc agt cgc gag 4464 
Leu val Cys Ala Gin Val Ala Thr val Leu Gly His Gly Ser Arg Glu 
1475 1480 1485 
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gaa gtc cag ccc gag egg gcg ttc cgc gcg etc ggg ttc gac tec etc 4512 
Glu val Gin Pro Glu Arg Ala Phe Arg Ala Leu Gly Phe Asp Ser Leu 
1490 1495 1500 

atg gcg gtg gat ctg cgc aat cgt ttg acc acc gec acc gqg ttg cgc 4560 
Met Ala VaT Asp Leu Arg Asn Arg Leu Thr Thr Ala Thr Gly Leu Arg 
1505 1510 1515 1520 

ctg ccg acc aca acc gtc ttc gac tac ccg aat ccg gec gec ttg gec 4608 
Leu Pro Thr Thr Thr Val Phe Asp Tyr Pro Asn Pro Ala Ala Leu Ala 
1525 1530 1535 

get cac ctg etc gag gag ctg gtg ggt gat gtc gcg teg get gcg gtg 4656 
Ala His Leu Leu Glu Glu Leu VaT Gly Asp Val Ala Ser Ala Ala Val 
1540 1545 1550 

acc get gec age gcg ccc gcg agt gac gaa ccg ate gcg ate gtc gcg 4704 
Thr Ala Ala Ser Ala Pro Ala Ser Asp Glu Pro lie Ala lie val Ala 
1555 1560 1565 

atg age tgc egg ttt ccg ggt ggc gcg cac teg ccg gaa gac ctg tgg 4752 
Met ser cys Arg Phe Pro Gly Gly Ala His Ser Pro Glu Asp Leu Trp 
1570 1575 1580 

egg ctg gtc gee gee ggc acg gag gtg ate ggc gag ttc ccc tec gac 4800 
Arg Leu val Ala Ala Gly Thr Glu val lie Gly Glu Phe Pro Ser Asp 
1585 1590 1595 1600 

egg ggc tgg gat gcg gaa ggc ctt tac gat ccg gat get tec agg cct 4848 
Arg Gly Trp Asp Ala Glu Gly Leu Tyr Asp Pro Asp Ala Ser Arg Pro 
1605 1610 1615 

gga acg acg tat gcg egg atg gcg gga ttc etc tac gac gec ggt gag 4896 
Gly Thr Thr Tyr Ala Arg Met Ala Gly Phe Leu Tyr Asp Ala Gly Glu 
1620 1625 1630 

ttc gat gee gac ctg ttc ggc ate age cca cgt gag gcg ttg gcg atg 4944 
Phe Asp Ala Asp Leu Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

gat ccg cag cag egg ttg gtg etc gaa ate gec tgg gaa gec etc gaa 4992 
Asp Pro Gin Gin Arg Leu val Leu Glu lie Ala Trp Glu Ala Leu Glu 
1650 ^ 1655 1660 

egg gee gga ate gat ccg ttg tec ttg aag ggc agt ggg gtc ggc acg 5040 
Arg Ala Gly lie Asp Pro Leu Ser Leu Lys Gly Ser Gly Val Gly Thr 
1665 1670 1675 ' 1680 

tac ate ggc get gga age cgt ggg tac gcg acg gat gtg egg cag ttt 5088 
Tyr lie Gly Ala Gly Ser Arg Gly Tyr Ala Thr Asp Val Arg Gin Phe 
1685 1690 1695 

ccc gag gag gcg gag ggc tac ctg ctg acg ggt acc teg gec agt gtg 5136 
Pro Glu Glu Ala Glu Gly Tyr Leu Leu Thr Gly Thr Ser Ala Ser Val 
1700 1705 1710 

ctg teg ggt egg gtc gcg tat teg ttt ggt ttc gag ggt cct gcg gtg 5184 
Leu ser Gly Arg val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val 
1715 ~ 1720 1725 

acg gtg gat acg get tgt teg teg teg ttg gtg gcg ttg cat ctg gcg 5232 
Thr val Asp Thr Ala cys Ser Ser Ser Leu val Ala Leu His Leu Ala 
1730 1735 1740 
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tgc cag teg ttg cgt teg gqc gag tgt gat ctg gcg ttg gec gqt gqt 5280 

Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly 
1745 1750 1755 1760 

gtq acc gtq atg teg acg ccg gag atg ttc gtq gag ttc tec cgt cag 5328 
val Thr val Met Ser Thr Pro Glu Met Phe val Glu Phe Ser Arg Gin 
1765 1770 1775 

cgc gqt ttg gcg ccg gat gqg egg tgc aag teg ttc gcg gag age gcg 5376 
Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala 
1780 1785 1790 

gac gqc acc gqc tgg gqc gaa gqc gcg gqc ctg ttg ttg ctg gag egg 5424 
Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg 
1795 1800 1805 

ttg teg gac gee cac egg aat gqg cat egg gtq ttg gcg gtq gtt cgt 5472 
Leu Ser Asp Ala His Arg Asn Gly His Arg val Leu Ala val val Arg 
1810 1815 ~ 1820 

gqg tea gcg gtq aat cag gac gqc gee teg aac gqa ctg gcg gcg ccg 5520 
Gly Ser Ala val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ala Ala Pro 
1825 1830 1835 1840 

aac gqt ccg teg cag cag egg gtq ate aac cag gca etc gcg aat gcg 5568 
Asn Gly Pro Ser Gin Gin Arg Val lie Asn Gin Ala Leu Ala Asn Ala 
1845 ~ 1850 1855 

get ctt teg gcg tec gat gtq gat gcg gtq gag gca cat gqc acc gqg 5616 
Ala Leu Ser Ala Ser Asp val Asp Ala Val Glu Ala His Gly Thr Gly 
1860 1865 1870 

acc agg ctg gqt gat ccg ate gag gcg cag gca ttg ate gca acg tat 5664 
Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr 
1875 1880 1885 

gqg cag gee egg gag egg gat egg ccc ttg tgg ctg gqg teg gtc aag 5712 
Gly Gin Ala Arg Glu Arg Asp Arg Pro Leu Trp Leu Gly Ser val Lys 
1890 1895 1900 

teg aac ate gqt cat acg cag gee gcg gcg gqt gtt gee gqt gtq ate 5760 
ser Asn lie Gly His Thr Gin Ala Ala Ala Gly val Ala Gly Val lie 
1905 1910 1915 1920 

aag atg gtq atg gee atg egg cac gqg cag ctg ccc gee teg ctg cac 5808 
Lys Met val Met Ala Met Arg His Gly Gin Leu Pro Ala Ser Leu His 
1925 1930 1935 

gcg gat gag ccc acg teg gag gtc gat tgg teg teg gqg gcg gtc egg 5856 
Ala Asp Glu Pro Thr Ser Glu Val Asp Trp Ser Ser Gly Ala Val Arg 
1940 1945 1950 

etc etc gee gaa cag gta cct tgg ccg gag tct gac cgt gtt cgt egg 5904 
Leu Leu Ala Glu Gin val Pro Trp Pro Glu Ser Asp Arg Val Arg Arg 
1955 1960 1965 

gtg ggg gtt teg teg ttc gqg ate age gqc acc aac gca cat gtq ate 5952 
val Gly val Ser Ser Phe Gly lie Ser Gly Thr Asn Ala His Val lie 
1970 1975 1980 

etc gaa caa get acg aat gcg cca gat agt aca gcg gag acg gac aaa 6000 
Leu Glu Gin Ala Thr Asn Ala Pro Asp Ser Thr Ala Glu Thr Asp Lys 
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1985 1990 1995 2000 

aca gaa tec gqa tct act gtc gat att ccg gtc gtt ccc tgg ttg gtq 
Thr Glu ser Gly ser Thr Val Asp lie Pro val Val Pro Trp Leu val 
2005 2010 2015 



ttc gqg ttc gtq ttc teg gqg cag ggt gqt cag tgg ttg gqg atg gqc 
Phe Gly Phe val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly Met Gly 
2100 2105 2110 



gac gqc gcg atg ttg gcg gtq get get gqt gaa gac ctt gtt egg cca 
Asp Gly Ala Met Leu Ala Val Ala Ala Gly Glu Asp Leu Val Arg Pro 
2225 2230 2235 2240 



6048 



teg gqa aag acg acg gat tec ctg egg gqa caa gcc gaa cga gtc ttg 6096 
Ser Gly Lys Thr Thr Asp Ser Leu Arg Gly Gin Ala Glu Arg Val Leu 
2020 2025 2030 

tct cag gtc gag tec egg ccg gag cag cgt teg ctg gat gtt gcc tac 6144 
Ser Gin val Glu ser Arg Pro Glu Gin Arg Ser Leu Asp Val Ala Tyr 
2035 " 2040 2045 

teg ctt get tct gqc cga gcc gcg ctg gat gaa cgc get gtc gtq ctg 6192 
ser Leu Ala Ser Gly Arg Ala Ala Leu Asp Glu Arg Ala Val Val Leu 
2050 2055 2060 

gqt gcg gac cgc gqt gag ctg gtt get gqa ctg gcg gcg ttg gcc gcc 
Gly Ala Asp Arg Gly Glu Leu Val Ala Gly Leu Ala Ala Leu Ala Ala 
2065 2070 2075 2080 



6240 



gqt cag gag get tct gqg gtq ate age gqa act cgt get tct get egg 6288 
Gly Gin Glu Ala ser Gly val lie Ser Gly Thr Arg Ala ser Ala Arg 
2085 2090 2095 



6336 



aga gcg etc tac teg aag ttt ccg gtq ttc get get gcg ttt gat gag 6384 
Arg Ala Leu Tyr ser Lys Phe Pro Val Phe Ala Ala Ala Phe Asp Glu 
2115 2120 2125 

get tgc gcc gag ttg gag gca cat ctg gqg gaa gac cgc egg gtt egg 6432 
Ala Cys Ala Glu Leu Glu Ala His Leu Gly Glu Asp Arg Arg Val Arg 
2130 2135 2140 

gat gtq gtc ttc gqt tec gat gcg cag ctg ctg gat cag acg ctg tgg 6480 
Asp val val Phe Gly ser Asp Ala Gin Leu Leu Asp Gin Thr Leu Trp 
2145 2150 2155 2160 

gcg cag teg gqt ctg ttc gcg ctg caa gcc gqc etc ttg gqg ctg ctg 6528 
Ala Gin Ser Gly Leu Phe Ala Leu Gin Ala Gly Leu Leu Gly Leu Leu 
2165 2170 2175 

gqt teg tgg gqc gtt egg ccg gat gtq gtq atg gqg cat teg gtc gqg 6576 
Gly ser Trp Gly Val Arg Pro Asp Val Val Met Gly His Ser Val Gly 
2180 2185 2190 

gag ttg gcc gcc gcg ttt gcg get gqc gtq ttg teg ttg egg gat gcg 6624 
Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Leu Ser Leu Arg Asp Ala 
2195 2200 2205 

get egg ttg gtq gcc gcg cgc gcc egg ttg atg caa gcc ctg ccc tct 6672 
Ala Arg Leu Val Ala Ala Arg Ala Arg Leu Met Gin Ala Leu Pro Ser 
2210 2215 2220 



6720 



ttg ctg gcc ggt egg gag gag tec gtg age gtc gcc gcg etc aat gcc 6768 
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Leu Leu Ala Gly Arg Glu Glu Ser Val Ser Val Ala Ala Leu Asn Ala 
2245 2250 2255 

ccc ggt teg gtg gtg ttg teg ggc gat egg gag gtg ctg gec age ate 6816 
Pro Gly Ser Val Val Leu Ser Gly Asp Arg Glu val Leu Ala Ser lie 
2260 2265 2270 

gtc ggc egg ctg acc gag etc cga gtc egg acg egg cgc ttg egg gtc 6864 
Val Gly Arg Leu Thr Glu Leu Arg Val Arg Thr Arg Arg Leu Arg Val 
2275 2280 2285 

tec cat get ttt cat teg cac egg atg gac ccg atg ttg ggc gag ttc 6912 
Ser His Ala Phe His Ser His Arg Met Asp Pro Met Leu Gly Glu Phe 
2290 2295 2300 

gee cag ate gee gag tct gcg gag ttc ggt aag cca acg aca ccg ctt 6960 
Ala Gin lie Ala Glu Ser Ala Glu Phe Gly Lys Pro Thr Thr Pro Leu 
2305 2310 2315 2320 

gtg teg acg ttg acg ggt gag etc gac aga gec gcg gaa atg age aca 7008 
val Ser Thr Leu Thr Gly Glu Leu Asp Arg Ala Ala Glu Met Ser Thr 
2325 2330 2335 

cca ggg tat tgg gtg cgc cag gcg cgt gaa ccc gtc cgt ttc gec gac 7056 
Pro Gly Tyr Trp Val Arg Gin Ala Arg Glu Pro val Arg Phe Ala Asp 
2340 2345 2350 

ggt gtc cag gec ctg gca gcg cag ggc ata ggc acg gtc gtc gag etc 7104 
Gly val Gin Ala Leu Ala Ala Gin Gly lie Gly Thr Val Val Glu Leu 
2355 2360 2365 

ggc ccg gac gga acg ctg gcg gca ctg gtt egg gag tgt gcg acc gag 7152 
Gly Pro Asp Gly Thr Leu Ala Ala Leu val Arg Glu Cys Ala Thr Glu 
2370 2375 2380 

tec gat egg gtt ggg egg att teg teg ate cca ctg atg cgc agg gag 7200 
Ser Asp Arg Val Gly Arg lie Ser Ser lie Pro Leu Met Arg Arg Glu 
2385 2390 2395 2400 

egg gac gag acc cgt teg gtg atg aca gee ctg gcg cat etc cac acc 7248 
Arg Asp Glu Thr Arg Ser Val Met Thr Ala Leu Ala His Leu His Thr 
2405 2410 2415 

cgt ggt ggt gag gtg gac tgg cag gcg ttt ttc gec ggt acc ggc get 7296 
Arg Gly Gly Glu Val Asp Trp Gin Ala Phe Phe Ala Gly Thr Gly Ala 
2420 2425 2430 

agg cag etc gag ttg cca acg tat gec ttc caa cga cag cac tac tgg 7344 
Arg Gin Leu Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin His Tyr Trp 
2435 2440 2445 

ate gag tec agt gcg egg cca gca cgc gac cgc gca gac ate ggc gag 7392 
He Glu Ser ser Ala Arg Pro Ala Arg Asp Arg Ala Asp lie Gly Glu 
2450 2455 2460 

Qtg gcg g aa ca g ttc tgg a cc gcg gtt gac caa ggc gat ctg gca acg 7440 
val Ala Glu Gin Phe Trp Thr Ala Val Asp Gin Gly Asp Leu Ala Thr 
2465 2470 2475 2480 

ttg gtc gec get ctg gat ctt ggg gcg gac gac gac aca tgc gca teg 7488 
Leu val Ala Ala Leu Asp Leu Gly Ala Asp Asp Asp Thr Cys Ala Ser 
2485 2490 2495 
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ttg age gat gta ttg ccg gcg ttg tec tec tgg cga age gqa etc cgc 7536 

Leu Ser Asp val Leu Pro Ala Leu Ser Ser Trp Arg Ser Gly Leu Arg 
2500 2505 2510 

aac cgt teg etc gtc gat tec tgc egg tac cga ate agt tgg cat tec 7584 

Asn Arg Ser Leu val Asp ser Cys Arg Tyr Arg lie Ser Trp His Ser 
2515 2520 2525 



tct egg gag gtq ccg gee ccg aag att tec gqt ace tgg ctg ttg gtc 7632 
Ser Arg Glu Val Pro Ala Pro Lys lie Ser Gly Thr Trp Leu Leu val 
2530 2535 2540 



gtq ccc gqt get gcg gat gac gqa ttg gtc acg get ttg acg agt tea 7680 

val Pro Gly Ala Ala Asp Asp Gly Leu val Thr Ala Leu Thr ser Ser 
2545 2550 2555 2560 

ctg gtc gqa gqc gqc gee gag gtc gtc egg ate gqc ctg tec gaa gag 7728 

Leu val Gly Gly Gly Ala Glu val Val Arg lie Gly Leu Ser Glu Glu 

2565 2570 2575 

gac ccg cac cgc gag gac gtc gca cag egg ctg gec aat gcg ctg acg 7776 

Asp Pro His Arg Glu Asp Val Ala Gin Arg Leu Ala Asn Ala Leu Thr 

2580 2585 2590 

gat gec gqt caa etc gqt gqc gtq ctt teg ctg ttg gqg etc gat gaa 7824 

Asp Ala Gly Gin Leu Gly Gly Val Leu Ser Leu Leu Gly Leu Asp Glu 
2595 2600 2605 

teg cct get ccg gqa ttc tec tgc ttg cca act gqt ttc gcg ctg act 7872 

Ser Pro Ala Pro Gly Phe Ser cys Leu Pro Thr Gly Phe Ala Leu Thr 
2610 * 2615 2620 

gtq cag ctt ctg egg gee ttg egg aag gec gac gtc gag gcg cct ttt 7920 

Val Gin Leu Leu Arg Ala Leu Arg Lys Ala Asp val Glu Ala Pro Phe 
2625 2630 2635 2640 

tgg gcg gtq acg cgc gqc gqc gtc gcg ttg gaa gat gta cgc gtq tct 7968 

Trp Ala val Thr Arg Gly Gly Val Ala Leu Glu Asp Val Arg Val Ser 

2645 2650 2655 



ccg gag cag gec ctg gtc tgg gqg ctg ctg cgt gtc gcg gqa ctg gag 
Pro Glu Gin Ala Leu val Trp Gly Leu Leu Arg val Ala Gly Leu Glu 
2660 2665 2670 



gat caa gtc gee att cgc cgt ggt gqt gtq ttc gtg egg 
Asp Gin val Ala lie Arg Arg Gly Gly Val Phe Val Arg 
2705 2710 2715 



cgc get gqt gcg teg gqt gec gqg teg gtq tgg cgt cct egg gqg acg 
Arg Ala Gly Ala Ser Gly Ala Gly Ser Val Trp Arg Pro Arg Gly Thr 
2725 2730 2735 



8016 



cac ccg gag ttc tgg gqt gqc ttg ate gac ctg cca teg gac tgg gac 8064 

His Pro Glu Phe Trp Gly Gly Leu lie Asp Leu Pro ser Asp Trp Asp 
2675 2680 2685 

gac cga ttg gqt gec egg ttg gcg gqt gtq ttg gcg gat gqt gqc gag 8112 

Asp Arg Leu Gly Ala Arg Leu Ala Gly Val Leu Ala Asp Gly Gly Glu 
2690 2695 2700 



egg ttg gaa 8160 
Arg Leu Glu 
2720 



8208 



gtq ttg gtq acg gqt gqt acg gqc gqt ttg gqg gcg cat gtt gee egg 8256 

val Leu val Thr Gly Gly Thr Gly Gly Leu Gly Ala His val Ala Arg 
2740 2745 2750 
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tgg ttg gcc gqt gcc gqg get gag cac gtq gtq ttg acc age cgt cga 8304 
Trp Leu Ala Gly Ala Gly Ala Glu His Val val Leu Thr Ser Arg Arg 
2755 2760 2765 



gqa gcg gac get ccg gqc get gqg gaa ttg egg gcg gag ctg gag gcg 
Gly Ala Asp Ala Pro Gly Ala Gly Glu Leu Arg Ala Glu Leu Glu Ala 
2770 2775 2780 

ctg gqt get egg gtq teg att gtq ccc tgc gac gtq get gat cgt gac 
Leu Gly Ala Arg val Ser lie Val Pro Cys Asp Val Ala Asp Arg Asp 
2785 2790 2795 2800 



gcg gtq gta cac gcc gcc gqg gtc gqc gag gcg gqc gac gta gtq gag 
Ala val val His Ala Ala Gly val Gly Glu Ala Gly Asp val val Glu 
2820 2825 2830 

atg gqt ttg gcg gat ttt gca gcg gtq ttg teg gcg aag gtq cgt gqt 
Met Gly Leu Ala Asp Phe Ala Ala val Leu ser Ala Lys val Arg Gly 
2835 2840 2845 



gtq atg ttc tec teg gtq teg gqg gtq tgg gqa gcc gqc gqa cag gqt 
Val Met Phe Ser Ser Val Ser Gly Val Trp Gly Ala Gly Gly Gin Gly 
2865 2870 2875 2880 



egg gcg agg gqa ttg gtc gqg acc gcg gtt gcg tgg gqa ccg tgg gcc 

Arg Ala Arg Gly Leu Val Gly Thr Ala Val Ala Trp Gly Pro Trp Ala 
2900 2905 2910 

gqt gac gqc atg gcc gcc gqc gaa acc gqc gca cag ctg cac egg atg 

Gly Asp Gly Met Ala Ala Gly Glu Thr Gly Ala Gin Leu His Arg Met 
2915 2920 2925 



gac acc ate gac gag gcc cga gcc gca ttg gaa acc acc gqc gaa caa 
Asp Thr lie Asp Glu Ala Arg Ala Ala Leu Glu Thr Thr Gly Glu Gin 
2980 2985 2990 

gcg gqc aca gqc aaa ccc gtt gag ctg acg caa cgc ctg gcc gqa ctg 
Ala Gly Thr Gly Lys Pro Val Glu Leu Thr Gin Arg Leu Ala Gly Leu 
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8352 



8400 



gca gtq get gqa gtq ttg gca gqg ate gqt gqg gag tgt ccg ctg act 8448 
Ala Val Ala Gly Val Leu Ala Gly lie Gly Gly Glu Cys Pro Leu Thr 
2805 2810 2815 



8496 



8544 



gcg gcg aat ctg gac gag ttg ctg gcc gac teg gag ctg gat gcg ttt 8592 
Ala Ala Asn Leu Asp Glu Leu Leu Ala Asp ser Glu Leu Asp Ala Phe 
2850 2855 2860 



8640 



gcg tat gcg get gcg aac gcc tac ttg gat gcg ttg gcc gag cag cgt 8688 
Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin Arg 
2885 2890 2895 



8736 



8784 



gqc ctg gcg teg atg gaa ccg age gcg gcg ctg ctg gca ctt cag gqt 8832 

Gly Leu Ala Ser Met Glu Pro Ser Ala Ala Leu Leu Ala Leu Gin Gly 
2930 2935 2940 

gca ttg gac cgc gat gag acc tec etc gtc gtq gcc gat gtc gat tgg 8880 

Ala Leu Asp Arg Asp Glu Thr Ser Leu val Val Ala Asp Val Asp Trp 
2945 2950 2955 2960 

gca egg ttc gcc cca gcc ttc acc teg gca cgt cga cgc ccg ctg ctg 8928 

Ala Arg Phe Ala Pro Ala Phe Thr Ser Ala Arg Arg Arg Pro Leu Leu 
2965 2970 " 2975 



8976 



9024 
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2995 3000 3005 

teg egg aag gaa cgc gac gat gcg gta ttg gat ctg gtq egg gcg gag 9072 
Ser Arg Lys Glu Arg Asp Asp Ala Val Leu Asp Leu val Arg Ala Glu 
3010 3015 3020 

acg gcg get gtq ctg gga cgc gac gat gec acg gec ctg gcg cca teg 9120 
Thr Ala Ala Val Leu Gly Arg Asp Asp Ala Thr Ala Leu Ala Pro Ser 
3025 3030 3035 3040 

egg ccg ttc cag gaa etc gga ttc gac tec ttg atg gcg gtq gag ctg 9168 
Arg Pro Phe Gin Glu Leu Gly Phe Asp Ser Leu Met Ala Val Glu Leu 
3045 3050 3055 



cgc aac egg ctg aac acc gec ace gqg ate cag ctg ccc gec age acg 
Arg Asn Arg Leu Asn Thr Ala Thr Gly lie Gin Leu Pro Ala Ser Thr 
3060 3065 3070 



9216 



att ttc gac tac ccc aat gec gag teg ctg teg cgt cac etc tgc gec 9264 
lie Phe Asp Tyr Pro Asn Ala Glu Ser Leu Ser Arg His Leu Cys Ala 
3075 3080 3085 

gag ctt ttc cca acg gag act acc gtq gac teg gee ctt gec gag etc 9312 
Glu Leu Phe Pro Thr Glu Thr Thr Val Asp Ser Ala Leu Ala Glu Leu 
3090 3095 3100 

gat cga ate gag cag cag etc teg atg etc acc ggc gaa gcg egg gca 9360 
Asp Arg lie Glu Gin Gin Leu Ser Met Leu Thr Gly Glu Ala Arg Ala 
3105 3110 3115 3120 

egg gac cga ate gcg aca cga ctg cga gec etc cac gag aag tgg aac 9408 
Arg Asp Arg lie Ala Thr Arg Leu Arg Ala Leu His Glu Lys Trp Asn 
3125 3130 3135 

age gca get gaa gta ccg acc gga gec gat gtc ctg age acg etc gat 9456 
Ser Ala Ala Glu val Pro Thr Gly Ala Asp Val Leu Ser Thr Leu Asp 
3140 3145 3150 

teg gcg acg cac gac gag ata ttc gag ttc ate gac aac gag etc gac 9504 
Ser Ala Thr His Asp Glu lie Phe Glu Phe lie Asp Asn Glu Leu Asp 
3155 3160 3165 

ctg tec tga 9513 
Leu Ser 
3170 

<210> 46 
<211> 3170 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 46 

Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala Leu Val 
1 5 10 ~ ^ 15 

Asp Leu His Gin Ala Arg Glu Arg Leu His Glu Ala Glu Ser Gly Glu 
20 25 30 * 

Arg Glu Pro lie Ala lie Val Ala Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 

Val Gin Asp Pro Glu Gly Leu Trp Lys Leu Val Ala Ser Gly Gly Asp 
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50 55 60 

Ala lie Gly Glu Phe Pro Ala Asp Arg Gly Trp His Leu Asp Glu Leu 
65 70 75 80 

Tyr Asp Pro Asp Pro Asp Gin Pro Gly Thr Cys Tyr Thr Arg His Gly 
85 90 95 

Gly Phe Leu His Asp Ala Gly Glu Phe Asp Ala Gly Phe Phe Asp lie 
100 105 110 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu lie Ser Trp Glu Thr Val Glu Ser Ala Gly Met Asp Pro Arg Ser 
130 135 140 

Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr Glu Gly 
145 150 155 ' 160 

Tyr Asp Thr Gly Ala His Arg Ala Gly Glu Gly val Glu Gly Tyr Leu 
165 ~ 170 175 

Gly Thr Gly Asn Ala Gly ser Val Ala Ser Gly Arg val Ala Tyr Ala 
180 185 190 

Phe Gly Phe Glu Gly Pro Ala val Thr val Asp Thr Ala Cys Ser Ser 
195 200 205 

Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Gin Gly Glu 
210 215 220 

Cys Asp Leu Ala Leu Ala Gly Gly val Thr Val Met Ser Thr Pro Glu 
225 230 235 240 

Arg Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 

Ala Gly Leu val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

His Arg val Leu Ala val val Arg Gly Ser Ala val Asn Gin Asp Gly 
290 295 300 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Leu Ala Gin Glu Arg val 
305 310 315 320 

lie Gin Gin Val Leu Thr Ser Ala Gly Leu Ser Ala Ser Asp Val Asp 
325 330 335 

Ala val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu 
340 345 ' 350 

Ala Gin Ala Leu He Ala Ala Tyr Gly Gin Asp Arg Asp Arg Asp Arg 
355 360 365 

Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

Ala Ala Gly Val Ala Gly val lie Lys Met Val Met Ala Met Arg His 
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385 390 395 400 

Gly Glu Leu Pro Arg Thr Leu His val Asp Glu Pro Asn ser His val 
405 410 415 

Asp Trp ser Ala Gly Ala val Arg Leu Leu Thr Glu Asn lie Arg Trp 
420 425 430 

Pro Gly Thr Gly Thr Arg Arg Ala Gly val ser Ser Phe Gly val ser 
435 440 445 

Gly Thr Asn Ala His val lie Leu Glu His Asp Pro Leu Ala val Thr 
450 455 460 

Glu Asn Glu Glu Ala Ala Gin ser Pro Ala Pro Gly lie Val Pro Trp 
465 470 475 480 

Ala Leu ser Gly Arg Ser Ser Thr Ala Leu Arg Ala Gin Ala Glu Arg 
485 490 " 495 

Leu Arg Glu Leu Cys Glu Gin Thr Asp Pro Asp Pro val Asp Val Gly 
500 505 510 

Phe Ser Leu Ala Ala Thr Arg Thr Ala Trp Glu His Arg Ala Val val 
515 520 525 

Leu Gly Arg Asp Ser Ala Thr Leu Arg Ser Gly Leu Gly val val Ala 
530 535 540 

Ser Gly Glu Pro Ala val Asp val val Glu Gly Ser val Leu Asp Gly 
545 550 555 560 

Glu Val val Phe val Phe Pro Gly Gin Gly Trp Gin Trp Ala Gly Met 
565 570 575 

Ala val Asp Leu Leu Asp Ala Ser Pro Thr Phe Ala Arg His Met Asp 
580 585 590 

Glu Cys Ala Thr Ala Leu Arg Arg Tyr Val Asp Trp Ser Leu Val Asp 
595 600 605 

Val Leu Arg Gly Ala Glu Asn ser Pro Pro Leu Asp Arg Val Asp val 
610 615 620 

Leu Gin Pro Ala Ser Phe Ala Val Met Val Ser Leu Ala Glu val Trp 
625 630 635 640 

Arg ser Tyr Gly val Arg Pro Ala Ala val val Gly His Ser Gin Gly 
645 650 655 

Glu lie Ala Ala Ala Cys Ala Ala Gly Val Leu Pro Leu Glu Asp Ala 
660 665 670 

Ala Arg Leu val Ala Leu Arg Ser Arg Ala Leu Lys Gly Leu Ser Gly 
675 680 685 

Arg Gly Gly Met Ala Ser Leu Ala Cys Pro Ala Asp Glu val Ala Ala 
690 695 700 

Leu Phe Ala Gly Ser Gly Gly Arg Leu Glu Val Ala Ala lie Asn Gly 
705 710 715 720 

Pro Arg ser Val Val Val Ser Gly Asp Leu Glu Ala val Asp Glu Leu 
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725 730 735 

Leu Ala Glu Cys Ala Glu Lys Asp Met Arg Ala Arg Arg He Pro val 
740 745 * 750 

Asp Tyr Ala ser His Ser Ala His val Glu val val Arg ser Pro val 
755 760 765 

Leu Ala Ala Ala Ala Gly val Arg His Arg Asp Gly Gin Val Pro Trp 
770 775 ~ 780 

Trp Ser Thr val lie Gly Asp Trp Val Asp Pro Ala Arg Leu Asp Gly 
785 790 795 ~ 800 

Glu Tyr Trp Tyr Arg Asn Leu Arg Gin Pro Val Arg Phe Glu His Ala 
805 810 " 815 

val Gin Gly Leu Val Glu Arg Gly Phe Gly Leu Phe lie Glu Met Ser 
820 " 825 830 

Ala His Pro val Leu Thr Thr Ala Val Glu Glu Thr Gly Ala Glu Ser 
835 840 845 

Glu Thr Ala val Ala Ala val Gly Thr Leu Arg Arg Asp Ser Gly Gly 
850 855 860 

Leu Arg Arg Leu Leu His Ser Leu Ala Glu Ala Tyr val Arg Gly Ala 
865 " 870 875 880 

Thr val Asp Trp Ala val Ala Phe Gly Gly Ala Gly Arg Arg Leu Asp 
885 890 ~ 895 

Leu Pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Asp Lys Gly 
900 905 910 

Ala Ala Ser Asp Glu Ala Arg Ala val Ser Asp Pro Ala Ala Gly Trp 
915 920 925 

Phe Trp Gin Ala val Ala Arg Gin Asp Leu Lys Ser val Ser Asp Ala 
930 935 940 

Leu Asp Leu Asp Ala Asp Ala Pro Leu Ser Ala Thr Leu Pro Ala Leu 
945 950 955 960 

Ser val Trp His Arg Gin Glu Arg Glu Arg val Leu Ala Asp Gly Trp 
965 970 975 

Arg Tyr Arg val Asp Trp Val Arg Val Ala Pro Gin Pro val Arg Arg 
980 985 990 

Thr Arg Glu Thr Trp Leu Leu Val val Pro Pro Gly Gly lie Glu Glu 
995 1000 1005 

Ala Leu val Glu Arg Leu Thr Asp Ala Leu Asn Thr Arg Gly lie Ser 
1010 1015 1020 

Thr Leu Arg Leu Asp Val Pro Pro Ala Ala Thr Ser Gly Glu Leu Ala 
025 1030 1035 1040 

Thr Glu Leu Arg Ala Ala Ala Asp Gly Asp Pro Val Lys Ala lie Leu 
1045 1050 1055 

Ser Leu Thr Ala Leu Asp Glu Arg Pro His Pro Glu Cys Lys Asp val 
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1060 1065 1070 

pro Ser Gly lie Ala Leu Leu Leu Asn Leu Val Lys Ala Leu Gly Glu 
1075 1080 1085 

Ala Asp Leu Arg lie Pro Leu Trp Thr lie Thr Arg Gly Ala val Lys 
1090 1095 1100 

Ala Gly Pro Ala Asp Arg Leu Leu Arg Pro Met Gin Ala Gin Ala Trp 
105 1110 " 1115 1120 

Gly Leu Gly Arg Val Ala Ala Leu Glu His Pro Glu Arg Trp Gly Gly 
1125 1130 1135 

Leu lie Asp Leu Pro Asp Ser Leu Asp Gly Asp val Leu Thr Arg Leu 
1140 1145 1150 

Gly Glu Ala Leu Thr Asn Gly Leu Ala Glu Asp Gin Leu Ala lie Arg 
1155 1160 1165 

Gin ser Gly val Leu Ala Arg Arg Leu val Pro Ala Pro Ala Asn Gin 
1170 1175 " 1180 

Pro Ala Gly Arg Lys Trp Arg Pro Arg Gly Ser Ala Leu lie Thr Gly 
185 1190 " 1195 1200 

Gly Leu Gly Ala val Gly Ala Gin val Ala Arg Trp Leu Ala Glu lie 
1205 1210 1215 

Gly Ala Glu Arg lie val Leu Thr ser Arg Arg Gly Asn Gin Ala Ala 
1220 1225 " " 1230 

Gly Ala Ala Glu Leu Glu Ala Glu Leu Arg Ala Leu Gly Ala Gin val 
1235 1240 " 1245 

Ser lie val Ala cys Asp val Thr Asp Arg Ala Glu Met Ser Ala Leu 
1250 1255 1260 

Leu Ala Glu Phe Asp val Thr Ala Val Phe His Ala Ala Gly Val Gly 
265 1270 1275 1280 

Arg Leu Leu Pro Leu Ala Glu Thr Asp Gin Asn Gly Leu Ala Glu lie 
1285 1290 1295 

Cys Ala Ala Lys val Arg Gly Ala Gin val Leu Asp Glu Leu Cys Asp 
1300 1305 1310 

Ser Thr Asp Leu Asp Ala Phe val Leu Phe Ser ser Gly Ala Gly val 
1315 1320 1325 

Trp Gly Gly Gly Gly Gin Gly Ala Tyr Gly Ala Ala Asn Ala Phe Leu 
1330 1335 1340 

Asp Thr Leu Ala Glu Gin Arg Arg Ala Arg Gly Leu Pro Ala Thr Ser 
345 1350 1355 1360 

lie Ser Trp Gly Ser Trp Ala Gly Gly Gly Met Ala Asp Gly Ala Ala 
1365 1370 1375 

Gly Glu His Leu Arg Arg Arg Gly lie Arg Pro Met Pro Ala Ala Ser 
1380 1385 ~ 1390 

Ala lie Leu Ala Leu Gin Glu val Leu Asp Gin Asp Glu Thr cys val 
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1395 1400 1405 

Ser lie Ala Asp val Asp Trp Asp Arg Phe val Pro Thr Phe Ala Ala 
1410 1415 1420 

Thr Arg Ala Thr Arg Leu Phe Asp Glu val Pro Ala Ala Arg Lys Ala 
425 1430 1435 1440 

Met Pro Ala Asn Gly Pro Ala Glu Pro Gly Gly Ser Pro Phe Ala Arg 
1445 1450 1455 

Asn Leu Ala Glu Leu Pro Glu Ala Gin Arg Arg His Glu Leu Val Asp 
1460 1465 ~ 1470 

Leu val cys Ala Gin Val Ala Thr val Leu Gly His Gly Ser Arg Glu 
1475 1480 1485 

Glu val Gin Pro Glu Arg Ala Phe Arg Ala Leu Gly Phe Asp Ser Leu 
1490 1495 1500 

Met Ala val Asp Leu Arg Asn Arg Leu Thr Thr Ala Thr Gly Leu Ar 
505 1510 1515 152 

Leu Pro Thr Thr Thr Val Phe Asp Tyr Pro Asn Pro Ala Ala Leu Ala 
1525 1530 1535 

Ala His Leu Leu Glu Glu Leu Val Gly Asp Val Ala Ser Ala Ala Val 
1540 1545 1550 

Thr Ala Ala Ser Ala Pro Ala Ser Asp Glu Pro lie Ala lie val Ala 
1555 1560 1565 

Met Ser Cys Arg Phe Pro Gly Gly Ala His Ser Pro Glu Asp Leu Trp 
1570 ~ 1575 1580 

Arg Leu Val Ala Ala Gly Thr Glu Val lie Gly Glu Phe Pro Ser Asp 
585 1590 1595 1600 

Arg Gly Trp Asp Ala Glu Gly Leu Tyr Asp Pro Asp Ala Ser Arg Pro 
1605 * 1610 1615 

Gly Thr Thr Tyr Ala Arg Met Ala Gly Phe Leu Tyr Asp Ala Gly Glu 
1620 1625 1630 

Phe Asp Ala Asp Leu Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

Asp Pro Gin Gin Arg Leu Val Leu Glu lie Ala Trp Glu Ala Leu Glu 
1650 1655 1660 

Arg Ala Gly lie Asp Pro Leu Ser Leu Lys Gly Ser Gly Val Gly Thr 
665 1670 1675 1680 

Tyr lie Gly Ala Gly ser Arg Gly Tyr Ala Thr Asp val Arg Gin Phe 
1685 1690 1695 

Pro Glu Glu Ala Glu Gly Tyr Leu Leu Thr Gly Thr Ser Ala Ser Val 
1700 1705 1710 

Leu Ser Gly Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val 
1715 ~ 1720 1725 

Thr val Asp Thr Ala Cys ser Ser Ser Leu Val Ala Leu His Leu Ala 
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1730 1735 1740 

Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly 
745 1750 1755 1760 

val Thr Val Met Ser Thr Pro Glu Met Phe val Glu Phe Ser Arg Gin 
1765 1770 1775 

Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala 
1780 1785 1790 

Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg 
1795 1800 1805 

Leu Ser Asp Ala His Arg Asn Gly His Arg Val Leu Ala Val Val Arg 
1810 1815 " 1820 

Gly Ser Ala val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ala Ala Pro 
825 1830 1835 1840 

Asn Gly Pro Ser Gin Gin Arg Val lie Asn Gin Ala Leu Ala Asn Ala 
1845 1850 1855 

Ala Leu ser Ala Ser Asp val Asp Ala val Glu Ala His Gly Thr Gly 
1860 1865 1870 

Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr 
1875 1880 1885 

Gly Gin Ala Arg Glu Arg Asp Arg Pro Leu Trp Leu Gly Ser Val Lys 
1890 1895 1900 

Ser Asn lie Gly His Thr Gin Ala Ala Ala Gly val Ala Gly val lie 
905 1910 1915 1920 

Lys Met val Met Ala Met Arg His Gly Gin Leu Pro Ala Ser Leu His 
1925 1930 1935 

Ala Asp Glu Pro Thr Ser Glu Val Asp Trp Ser Ser Gly Ala Val Arg 
1940 1945 1950 

Leu Leu Ala Glu Gin val Pro Trp Pro Glu Ser Asp Arg Val Arg Arg 
1955 1960 1965 

val Gly Val Ser ser Phe Gly lie Ser Gly Thr Asn Ala His val lie 
1970 1975 1980 

Leu Glu Gin Ala Thr Asn Ala Pro Asp Ser Thr Ala Glu Thr Asp Lys 
985 1990 1995 2000 

Thr Glu Ser Gly Ser Thr val Asp lie Pro val Val Pro Trp Leu val 
2005 2010 2015 

Ser Gly Lys Thr Thr Asp Ser Leu Arg Gly Gin Ala Glu Arg Val Leu 
2020 2025 2030 

Ser Gin val Glu Ser Arg Pro Glu Gin Arg Ser Leu Asp Val Ala Tyr 
2035 2040 2045 

Ser Leu Ala Ser Gly Arg Ala Ala Leu Asp Glu Arg Ala val val Leu 
2050 2055 2060 

Gly Ala Asp Arg Gly Glu Leu Val Ala Gly Leu Ala Ala Leu Ala Ala 
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065 2070 2075 2080 

Gly Gin Glu Ala Ser Gly val lie Ser Gly Thr Arg Ala Ser Ala Arg 
2085 2090 2095 

Phe Gly Phe val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly Met Gly 
2100 2105 * 2110 

Arg Ala Leu Tyr Ser Lys Phe Pro val Phe Ala Ala Ala Phe Asp Glu 
2115 2120 2125 

Ala Cys Ala Glu Leu Glu Ala His Leu Gly Glu Asp Arg Arg Val Arg 
2130 2135 2140 

Asp Val val Phe Gly Ser Asp Ala Gin Leu Leu Asp Gin Thr Leu Trp 
145 2150 2155 2160 

Ala Gin ser Gly Leu Phe Ala Leu Gin Ala Gly Leu Leu Gly Leu Leu 
2165 2170 2175 

Gly Ser Trp Gly Val Arg Pro Asp Val Val Met Gly His Ser val Gly 
2180 2185 2190 

Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Leu Ser Leu Arg Asp Ala 
2195 2200 2205 

Ala Arg Leu val Ala Ala Arg Ala Arg Leu Met Gin Ala Leu Pro Ser 
2210 2215 2220 

Asp Gly Ala Met Leu Ala Val Ala Ala Gly Glu Asp Leu Val Arg Pro 
225 2230 2235 2240 

Leu Leu Ala Gly Arg Glu Glu Ser val Ser Val Ala Ala Leu Asn Ala 
2245 2250 2255 

Pro Gly Ser val Val Leu Ser Gly Asp Arg Glu val Leu Ala Ser lie 
2260 2265 2270 

Val Gly Arg Leu Thr Glu Leu Arg Val Arg Thr Arg Arg Leu Arg val 
2275 2280 2285 

Ser His Ala Phe His Ser His Arg Met Asp Pro Met Leu Gly Glu Phe 
2290 2295 2300 

Ala Gin lie Ala Glu Ser Ala Glu Phe Gly Lys Pro Thr Thr Pro Leu 
305 2310 2315 2320 

Val Ser Thr Leu Thr Gly Glu Leu Asp Arg Ala Ala Glu Met Ser Thr 
2325 2330 2335 

Pro Gly Tyr Trp val Arg Gin Ala Arg Glu Pro Val Arg Phe Ala Asp 
2340 - 2345 2350 

Gly Val Gin Ala Leu Ala Ala Gin Gly lie Gly Thr val Val Glu Leu 
2355 2360 ' 2365 

Gly Pro Asp Gly Thr Leu Ala Ala Leu Val Arg Glu Cys Ala Thr Glu 
2370 2375 2380 

Ser Asp Arg val Gly Arg lie Ser Ser lie Pro Leu Met Arg Arg Glu 
385 ~ 2390 2395 2400 

Arg Asp Glu Thr Arg Ser Val Met Thr Ala Leu Ala His Leu His Thr 
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2405 2410 2415 

Arg Gly Gly Glu val Asp Trp Gin Ala Phe Phe Ala Gly Thr Gly Ala 
2420 2425 2430 

Arg Gin Leu Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin His Tyr Trp 
2435 2440 2445 

lie Glu ser ser Ala Arg pro Ala Arg Asp Arg Ala Asp lie Gly Glu 
2450 2455 " 2460 

val Ala Glu Gin Phe Trp Thr Ala Val Asp Gin Gly Asp Leu Ala Thr 
465 2470 2475 2480 

Leu val Ala Ala Leu Asp Leu Gly Ala Asp Asp Asp Thr cys Ala Ser 
2485 2490 2495 

Leu Ser Asp Val Leu Pro Ala Leu Ser Ser Trp Arg Ser Gly Leu Arg 
2500 2505 ' 2510 

Asn Arg ser Leu val Asp Ser Cys Arg Tyr Arg lie Ser Trp His Ser 
2515 2520 2525 

Ser Arg Glu Val Pro Ala Pro Lys lie Ser Gly Thr Trp Leu Leu Val 
2530 2535 2540 

val Pro Gly Ala Ala Asp Asp Gly Leu Val Thr Ala Leu Thr Ser Ser 
545 2550 2555 2560 

Leu val Gly Gly Gly Ala Glu val Val Arg lie Gly Leu Ser Glu Glu 
2565 2570 2575 

Asp Pro His Arg Glu Asp val Ala Gin Arg Leu Ala Asn Ala Leu Thr 
2580 2585 2590 

Asp Ala Gly Gin Leu Gly Gly Val Leu Ser Leu Leu Gly Leu Asp Glu 
2595 2600 2605 

Ser Pro Ala Pro Gly Phe Ser Cys Leu Pro Thr Gly Phe Ala Leu Thr 
2610 2615 2620 

val Gin Leu Leu Arg Ala Leu Arg Lys Ala Asp val Glu Ala Pro Phe 
625 2630 " 2635 2640 

Trp Ala val Thr Arg Gly Gly val Ala Leu Glu Asp val Arg val Ser 
2645 2650 2655 

Pro Glu Gin Ala Leu Val Trp Gly Leu Leu Arg Val Ala Gly Leu Glu 
2660 2665 2670 

His Pro Glu Phe Trp Gly Gly Leu lie Asp Leu Pro Ser Asp Trp Asp 
2675 2680 2685 

Asp Arg Leu Gly Ala Arg Leu Ala Gly val Leu Ala Asp Gly Gly Glu 
2690 2695 2700 

Asp Gin val Ala lie Arg Arg Gly Gly val Phe val Arg Arg Leu Glu 
705 2710 2715 " 2720 

Arg Ala Gly Ala Ser Gly Ala Gly Ser Val Trp Arg Pro Arg Gly Thr 
2725 2730 " 2735 

Val Leu val Thr Gly Gly Thr Gly Gly Leu Gly Ala His Val Ala Arg 
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2740 2745 2750 

Trp Leu Ala Gly Ala Gly Ala Glu His val val Leu Thr ser Arg Arg 
2755 " 2760 2765 

Gly Ala Asp Ala Pro Gly Ala Gly Glu Leu Arg Ala Glu Leu Glu Ala 
2770 2775 2780 

Leu Gly Ala Arg val Ser lie val Pro Cys Asp val Ala Asp Arg Asp 
785 2790 2795 2800 

Ala val Ala Gly val Leu Ala Gly lie Gly Gly Glu Cys Pro Leu Thr 
2805 2810 2815 

Ala val val His Ala Ala Gly val Gly Glu Ala Gly Asp val val Glu 
2820 2825 2830 

Met Gly Leu Ala Asp Phe Ala Ala val Leu Ser Ala Lys val Arg Gly 
2835 2840 2845 

Ala Ala Asn Leu Asp Glu Leu Leu Ala Asp Ser Glu Leu Asp Ala Phe 
2850 2855 2860 

Val Met Phe Ser Ser val Ser Gly Val Trp Gly Ala Gly Gly Gin Gly 
865 2870 2875 2880 

Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin Arg 
2885 2890 2895 

Arg Ala Arg Gly Leu val Gly Thr Ala val Ala Trp Gly Pro Trp Ala 
2900 2905 2910 

Gly Asp Gly Met Ala Ala Gly Glu Thr Gly Ala Gin Leu His Arg Met 
2915 2920 2925 

Gly Leu Ala ser Met Glu Pro Ser Ala Ala Leu Leu Ala Leu Gin Gly 
2930 2935 2940 

Ala Leu Asp Arg Asp Glu Thr Ser Leu val val Ala Asp Val Asp Trp 
945 2950 2955 2960 

Ala Arg Phe Ala Pro Ala Phe Thr Ser Ala Arg Arg Arg Pro Leu Leu 
2965 2970 " 2975 

Asp Thr lie Asp Glu Ala Arg Ala Ala Leu Glu Thr Thr Gly Glu Gin 
2980 2985 2990 

Ala Gly Thr Gly Lys Pro Val Glu Leu Thr Gin Arg Leu Ala Gly Leu 
2995 3000 3005 

Ser Arg Lys Glu Arg Asp Asp Ala val Leu Asp Leu val Arg Ala Glu 
3010 3015 3020 

Thr Ala Ala val Leu Gly Arg Asp Asp Ala Thr Ala Leu Ala Pro Ser 
025 3030 3035 3040 

Arg Pro Phe Gin Glu Leu Gly Phe Asp Ser Leu Met Ala val Glu Leu 
3045 3050 3055 

Arg Asn Arg Leu Asn Thr Ala Thr Gly lie Gin Leu Pro Ala Ser Thr 
3060 3065 3070 

lie Phe Asp Tyr Pro Asn Ala Glu ser Leu Ser Arg His Leu Cys Ala 
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3075 3080 3085 

Glu Leu Phe Pro Thr Glu Thr Thr val Asp Ser Ala Leu Ala Glu Leu 
3090 3095 3100 

Asp Arg lie Glu Gin Gin Leu Ser Met Leu Thr Gly Glu Ala Arg Ala 
105 3110 3115 3120 

Arg Asp Arg lie Ala Thr Arg Leu Arg Ala Leu His Glu Lys Trp Asn 
3125 3130 3135 

Ser Ala Ala Glu val Pro Thr Gly Ala Asp val Leu Ser Thr Leu Asp 
3140 3145 3150 

Ser Ala Thr His Asp Glu lie Phe Glu Phe lie Asp Asn Glu Leu Asp 
3155 3160 3165 

Leu Ser 
3170 



<210> 47 
<211> 14775 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 
<221> CDS 

<222> (1).. (14772) 

<223> ORF21; polyketide synthase 

<400> 47 

atg gcc aat gaa gaa aag etc ttc ggc tat ctg aag aag gta act gcg 48 
Met Ala Asn Glu Glu Lys Leu Phe Gly Tyr Leu Lys Lys val Thr Ala 
15 10 15 

gac ctg cat cag acc egg cag cgc ctg etc gcg gcc gag age egg agt 96 
Asp Leu His Gin Thr Arg Gin Arg Leu Leu Ala Ala Glu Ser Arg Ser 
20 25 30 

cag gag ccg ate gcg ate gtc teg gcg age tgc cga ctg ccc ggc ggc 144 
Gin Glu Pro lie Ala lie val Ser Ala Ser cys Arg Leu Pro Gly Gly 
35 40 45 



gtc gac tct ccc gaa gcg etc tgg caa etc gtg cgc act ggc acc gac 192 
Val Asp Ser Pro Glu Ala Leu Trp Gin Leu val Arg Thr Gly Thr Asp 
50 55 60 



gcc ate teg gag ttc ccc gcc gac egg ggc tgg gat etc ggc egg ttg 240 

Ala lie Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp Leu Gly Arg Leu 
65 70 75 80 

tac gat ccc gac ccg aac cac cag gga acg teg tac acg egg gcc ggc 288 

Tyr Asp Pro Asp Pro Asn His Gin Gly Thr Ser Tyr Thr Arg Ala Gly 
85 90 95 

ggt ttc etc gca gga gcg ggc gat ttc gac ccc gcc atg ttc ggg att 336 

Gly Phe Leu Ala Gly Ala Gly Asp Phe Asp Pro Ala Met Phe Gly lie 

100 105 110 

teg ccg cgt gag gcg ttg gcg atg gac ccg cag caa egg ttg ttg ctg 384 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
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115 120 125 

gag ctg tec tgg gag gec etc gaa egg gcg ggc ata gac ccg aca tec 
Glu Leu ser Trp Glu Ala Leu Glu Arg Ala Gly lie Asp Pro Thr Ser 
130 135 ~ 140 



432 



ctg cgc ggc age aag acc ggt gtc ttc ggt ggt gtc acg ccc cag gag 480 

Leu Arg Gly ser Lys Thr Gly Val Phe Gly Gly val Thr Pro Gin Glu 

145 150 155 160 

tac ggg ccg tec ttg cag gag atg age cga aac get ggg gqt ttt gqa 528 

Tyr Gly Pro Ser Leu Gin Glu Met Ser Arg Asn Ala Gly Gly Phe Gly 
165 170 175 

etc acc gqg egg atg gtg agt gtg gcg teg gqt egg gtt gcg tat teg 576 

Leu Thr Gly Arg Met Val Ser val Ala Ser Gly Arg val Ala Tyr Ser 

180 185 ~ 190 

ttt gqt ttt gag gqt cct gcg gtq acg gtq gat acg gcg tgt teg teg 

Phe Gly Phe Glu Gly Pro Ala val Thr val Asp Thr Ala Cys Ser Ser 

195 200 205 

teg ttg gtq gee ctg cat ttg gcg tgt cag teg ttg cgt tec ggc gaa 

Ser Leu val Ala Leu His Leu Ala Cys Gin Ser Leu Arg ser Gly Glu 
210 215 220 



624 



672 



tgc gat etc gcg ctg gec ggc ggt gtg acg gtg atg gcg aca ccg gcg 720 

Cys Asp Leu Ala Leu Ala Gly Gly Val Thr val Met Ala Thr Pro Ala 

225 230 235 240 

acg ttc gtg gag ttc tec cgt cag cgt ggt ttg get ccg gac ggg egg 768 

Thr Phe val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 ~ 250 255 

tgc aag teg ttc gcg get gec gcg gat ggc acc ggg tgg ggt gag ggt 

Cys Lys ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 



816 



gec ggt ctg gtq ttg ctg gag egg ttg teg gat gcg egg egg aat ggg 864 

Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 

275 280 285 

cac gag gtt ctg gcg gtg gtg egg ggt age gcg gtg aac cag gac ggc 912 

His Glu val Leu Ala val Val Arg Gly Ser Ala val Asn Gin Asp Gly 
290 295 300 

gcg teg aat ggt ttg act gcg ccg aat ggt ccg teg cag cag egg gtg 960 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val 

305 310 315 320 

ate acc cag gcg ttg gcg agt gcg ggg ctg teg gtt tec gat gtg gat 1008 

lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser val Ser Asp val Asp 
325 330 335 

gcg gtc gag gca cat ggg acc ggg acc acg ttg ggt gat ccg ate gag 1056 

Ala val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu 

340 345 350 

gca cag gee ctg ate gec acg tac ggg cag ggc egg gag aag gat egg 

Ala Gin Ala Leu lie Ala Thr Tyr Gly Gin Gly Arg Glu Lys Asp Arg 

355 360 365 

ccg ttg tgg ttg ggg teg gtc aag tec aac ate ggt cac acg cag gcg 1152 
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pro Leu Trp Leu Gly Ser val Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

gcc get gqc gtt gec gqc gtc ate aag atg gtc ttg gcg atg egg cac 1200 
Ala Ala Gly val Ala Gly val lie Lys Met val Leu Ala Met Arg His 
385 390 395 400 

gqg cag ctg ccc gcc acg ttg cat gtq gat gag ccc acg teg gcg gtq 1248 
Gly Gin Leu Pro Ala Thr Leu His val Asp Glu Pro Thr Ser Ala val 
405 410 415 

gac tgg teg gcg gqt teg gtc egg ctt etc acg gag aac acg ccc tgg 1296 
Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Asn Thr Pro Trp 
420 425 430 



ccg gac agt gqt cgt cct tgc egg gtq gqg gtq teg teg ttc gqg ate 
Pro Asp Ser Gly Arg Pro Cys Arg Val Gly val Ser Ser Phe Gly lie 
435 440 445 



gtc ccc gtq gtq cct tgg gtq ctg teg gqt aag aca ccg gag get gcg 
val Pro val val Pro Trp val Leu Ser Gly Lys Thr Pro Glu Ala Ala 
485 490 495 

egg gcg cag gcc gaa egg gtq cat teg cat ate gag gac egg ccg gqg 
Arg Ala Gin Ala Glu Arg val His Ser His lie Glu Asp Arg Pro Gly 
500 505 510 

ctg teg ccg gtq gat gtq gcg tat teg eta gqa atg aca cgc gcg gcg 
Leu Ser Pro Val Asp val Ala Tyr ser Leu Gly Met Thr Arg Ala Ala 
515 520 525 

ctg gat gaa cgc gca gtq gtq ttg gqc teg gac cgt gcc gcg etc ctg 
Leu Asp Glu Arg Ala val Val Leu Gly Ser Asp Arg Ala Ala Leu Leu 
530 535 540 



cag gqt gqt cag tgg ccg gqg atg gqc egg gqg etc tac teg gtq ttt 
Gin Gly Gly Gin Trp Pro Gly Met Gly Arg Gly Leu Tyr Ser Val Phe 
580 585 ~ 590 



cac ctg gqc cag gaa ctg egg gtt egg gat gtq gtq ttc gqt teg caa 
His Leu Gly Gin Glu Leu Arg val Arg Asp Val Val Phe Gly Ser Gin 
610 615 620 
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age gqc acc aac gca cat gtq att etc gaa cag tct cca gtc gag cag 1392 

Ser Gly Thr Asn Ala His val lie Leu Glu Gin Ser Pro val Glu Gin 
450 455 460 

gqc gaa ccg gcc gqg ccg gtc gaa gqc gag egg gaa ccg gat gta gcc 1440 

Gly Glu Pro Ala Gly Pro val Glu Gly Glu Arg Glu Pro Asp Val Ala 

465 470 475 480 



1488 



1536 



1584 



1632 



acc gqg ttg agg gca ttc gcc gac gqc tgc gat gcg ccc gaa gtq gtt 1680 
Thr Gly Leu Arg Ala Phe Ala Asp Gly Cys Asp Ala Pro Glu Val Val 
545 550 555 560 

teg gqg tct gtq gqg ctt gqt gqc cgc gtc gqg ttc gtq ttc teg gqt 1728 
Ser Gly Ser val Gly Leu Gly Gly Arg val Gly Phe Val Phe Ser Gly 
565 570 575 



1776 



ccg gtq ttc gcc gac gcg ttc gac gag get tgc gcg gag ttg gat gca 1824 
pro val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp Ala 
595 600 605 



1872 
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gcg tgg ttg ctg gat egg acg gtq tgg gcg cag teg gqt ttg ttc gcg 1920 
Ala Trp Leu Leu Asp Arg Thr Val Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

ttg cag att gqc ttg ctg egg ctg ctg gqt teg tgg gqt gtt egg ccg 
Leu Gin He Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly val Arg Pro 
645 ~ 650 655 

gat gtq gtq ttg gqg cac teg gtq gqt gag ctg get gcg gtq cat gcg 
Asp val val Leu Gly His Ser val Gly Glu Leu Ala Ala val His Ala 
660 665 670 



gee egg ttg atg cag gcg ttg cct tct gqt gqt gee atg etc gcg gtc 
Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala Val 
690 695 700 



egg ate gqt ate gcg gcg gtq aat gqc ccg gaa teg gtt gtq etc tct 
Arg lie Gly lie Ala Ala val Asn Gly Pro Glu Ser val Val Leu Ser 
725 730 735 



gaa tat cac gca ccg gaa ctg ccg ate ate teg acc ctg ate gqt gag 
Glu Tyr His Ala Pro Glu Leu Pro lie lie Ser Thr Leu lie Gly Glu 
785 790 795 800 



1968 



2016 



get gqt gtq ttg teg ttg teg gag gee gcg egg ttg gtq gcg gqt cgc 2064 
Ala Gly val Leu Ser Leu Ser Glu Ala Ala Arg Leu Val Ala Gly Arg 
675 680 ^ 685 



2112 



get acg gqt gag ttt cag gtc gat cct ctg ctg gat gqg gtq egg gac 2160 
Ala Thr Gly Glu Phe Gin Val Asp Pro Leu Leu Asp Gly val Arg Asp 
705 710 715 720 



2208 



gqt gac cgc gag ctg etc acc gag ate get gat egg ttg cac gat cag 2256 

Gly Asp Arg Glu Leu Leu Thr Glu lie Ala Asp Arg Leu His Asp Gin 

740 745 ~ 750 

gqg tgc egg acc egg tgg ttg egg gtq teg cat get ttc cat teg ccc 2304 

Gly Cys Arg Thr Arg Trp Leu Arg val Ser His Ala Phe His Ser Pro 

755 760 765 

cat atg gag ccg atg ctg gag gag ttc gee cag ate tec cga gqc cgc 2352 

His Met Glu Pro Met Leu Glu Glu Phe Ala Gin lie Ser Arg Gly Arg 

770 775 780 



2400 



ctg gac gqt gqt cga gtq atg gqc act ccc gag tac tgg gtq cgt cag 2448 

Leu Asp Gly Gly Arg Val Met Gly Thr Pro Glu Tyr Trp val Arg Gin 

805 810 815 

gtq cgt gag ccc gtc cgt ttc gee gag gqt gtc cag gcg ctt gtc gqt 2496 

val Arg Glu Pro val Arg Phe Ala Glu Gly val Gin Ala Leu val Gly 
820 ~ 825 830 

cag gqt gtc gqc acg att gtc gaa ttg gqt ccg gac gqg gcg ttg teg 2544 

Gin Gly val Gly Thr lie val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

acg ttg gtc gag gag tgt gtq gcg gaa tec gqg egg gtq gec gqg ate 

Thr Leu val Glu Glu cys Val Ala Glu Ser Gly Arg val Ala Gly lie 
850 855 860 



2592 



ccg ctg atg cgc aag gac cgc gac gag gcg cga acc gtq ctg gca get 2640 

Pro Leu Met Arg Lys Asp Arg Asp Glu Ala Arg Thr Val Leu Ala Ala 
865 870 "* 875 880 
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ttg gcg cag ate cac acc cgt ggt ggt gag gtg gac tgg egg teg ttt 2688 
Leu Ala Gin He His Thr Arg Gly Gly Glu Val Asp Trp Arg Ser Phe 
885 890 895 

ttc gee gqt acc ggg gcg aag caa gtc gac ctg ccc acc tac gee ttc 2736 
Phe Ala Gly Thr Gly Ala Lys Gin Val Asp Leu Pro Thr Tyr Ala Phe 
900 905 910 

cag egg cag egg tac tgg ctg gca tec acc gqg cgt gcg gqt gac gtq 2784 
Gin Arg Gin Arg Tyr Trp Leu Ala Ser Thr Gly Arg Ala Gly Asp val 
915 ~ 920 925 

acc gee gee gga ttg gee gag gcg gac cat ccg ctg etc gqt gcg gtq 2832 
Thr Ala Ala Gly Leu Ala Glu Ala Asp His Pro Leu Leu Gly Ala Val 
930 935 940 

gtt gcg ttg gca gac ggc gaa gqt gtq gtq ctg acc gqt egg ttg aca 2880 
val Ala Leu Ala Asp Gly Glu Gly Val Val Leu Thr Gly Arg Leu Thr 
945 950 955 w 960 

gcg gqt teg cat ccg tgg ttg tec gat cac egg gtq ctg gqc gaa ate 2928 
Ala Gly ser His Pro Trp Leu ser Asp His Arg val Leu Gly Glu lie 
965 970 975 

gtc gtc ccc gqc acc gcg ate gtc gag ctg gtq tgg cac gtc gqc gag 2976 
val val Pro Gly Thr Ala lie Val Glu Leu Val Trp His val Gly Glu 
980 985 990 

cgc etc gqt tgt gqc egg gtq gaa gaa ctg get ttg gaa gcg ccc ctg 3024 
Arg Leu Gly Cys Gly Arg Val Glu Glu Leu Ala Leu Glu Ala Pro Leu 
995 "* 1000 1005 

ate ctg ccg gat cat gga gcg gtc cag gtt cag gtq ctg gtq gga ccg 3072 
lie Leu Pro Asp His Gly Ala val Gin Val Gin Val Leu val Gly Pro 
1010 1015 1020 

ccc gqg gaa tec gga gee egg teg gtg gcg etc tac tec tgt cct ggc 3120 
Pro Gly Glu Ser Gly Ala Arg ser val Ala Leu Tyr ser Cys Pro Gly 
1025 1030 1035 * 1040 

gag gcg ate gaa ccc gag tgg aag aag cac gcg acg ggc gtg ctt etc 3168 
Glu Ala lie Glu Pro Glu Trp Lys Lys His Ala Thr Gly Vat Leu Leu 
1045 1050 1055 

cca ccc gtq gee gee gag aac cat gag ctg acc gca tgg ccc ccg gag 3216 
Pro Pro val Ala Ala Glu Asn His Glu Leu Thr Ala Trp Pro Pro Glu 
1060 1065 1070 

aat gcg acc gaa ate gat gca gac ggg gtc tac gca ttc ctt gaa ggg 3264 
Asn Ala Thr Glu lie Asp Ala Asp Gly Val Tyr Ala Phe Leu Glu Gly 
1075 1080 1085 

cac ggt ttc gcg tac gga ccg gee ttt aga tgt ctg cgc ggt gee tgg 3312 
His Gly Phe Ala Tyr Gly Pro Ala Phe Arg Cys Leu Arg Gly Ala Trp 
1090 1095 1100 

cga cga ggc ggg gag gtg ttc gec gaa gtc gca ttg ccg gat gac atg 3360 
Arg Arg Gly Gly Glu val Phe Ala Glu val Ala Leu Pro Asp Asp Met 
1105 1110 1115 1120 

cag gcg ggg gtc gat cga ttc ggc gtc cac ccc gcg ttg ctg gac gcg 3408 
Gin Ala Gly val Asp Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala 
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1125 1130 1135 

gtt ctg cat gcc gcc gca gcc gag acg teg gtq gtc cag age gaa gcg 3456 
val Leu His Ala Ala Ala Ala Glu Thr Ser val Val Gin Ser Glu Ala 
1140 1145 1150 

egg gtq ccg ttc teg tgg cgt gqg gtq gaa ctt cgc gcc act gaa age 
Arg val Pro Phe Ser Trp Arg Gly val Glu Leu Arg Ala Thr Glu Ser 
1155 1160 1165 



ctg gtq ace cga ccg ate tec egg cag cag gtq agg tct gqc gcg ate 
Leu val Thr Arg Pro lie Ser Arg Gin Gin Val Arg Ser Gly Ala lie 
1205 1210 " 1215 

gqt gat tgc ctg ttc gag gtq gag tgg cac egg aag gcg ttg ttg gqa 
Gly Asp cys Leu Phe Glu Val Glu Trp His Arg Lys Ala Leu Leu Gly 
1220 1225 1230 



ccg gaa teg gtq cgc gca acc gca egg ttc gcg ace ctg gat gag ttc 
Pro Glu ser val Arg Ala Thr Ala Arg Phe Ala Thr Leu Asp Glu Phe 
1250 1255 1260 

cgt gcg gcc gtq gac teg gac gtt cct gcc ccg gqt teg gtq ttg gtc 
Arg Ala Ala val Asp ser Asp val Pro Ala Pro Gly ser val Leu val 
1265 1270 1275 1280 

gca get atg teg gcc gaa gag gtc gag gqt gqa tec ctg ccg teg cgc 
Ala Ala Met Ser Ala Glu Glu Val Glu Gly Gly ser Leu Pro Ser Arg 
1285 1290 1295 

gcc caa gag teg acc tec gat ctg ctg get etc gtq cag teg tgg ctt 
Ala Gin Glu Ser Thr ser Asp Leu Leu Ala Leu Val Gin Ser Trp Leu 
1300 1305 1310 



3504 



gcg gtq gtq egg gcg cgc etc teg ttg act teg gat gac gaa ctg teg 3552 
Ala val val Arg Ala Arg Leu ser Leu Thr Ser Asp Asp Glu Leu Ser 
1170 1175 1180 

ttg gtc gca gtq gac ccg get gqc cga ttc gtq gcc acg gtt gat teg 3600 
Leu val Ala val Asp Pro Ala Gly Arg Phe Val Ala Thr val Asp Ser 
1185 1190 1195 1200 



3648 



3696 



aca acc gcc gqc gac gac ctt gcc ate gtc gqt gac gqt ccc agt tgg 3744 
Thr Thr Ala Gly Asp Asp Leu Ala lie Val Gly Asp Gly Pro Ser Trp 
1235 1240 1245 



3792 



3840 



3888 



3936 



gcg gac gag egg ttc gcc gaa tec cag etc gtq gtc gtc acg cgt gca 3984 
Ala Asp Glu Arg Phe Ala Glu Ser Gin Leu val val Val Thr Arg Ala 
1315 1320 1325 

gcg gtq teg gcc gac teg gat teg gac gtc gcg gac ctg gtq gqt gcg 4032 
Ala val ser Ala Asp Ser Asp Ser Asp Val Ala Asp Leu val Gly Ala 
1330 1335 1340 

teg teg tgg gqg ttg ttg agt tea gcc cag teg gag aac ccg gqt cgc 4080 
Ser Ser Trp Gly Leu Leu Ser Ser Ala Gin Ser Glu Asn Pro Gly Arg 
1345 1350 1355 1360 

ttc gtq ctg gtq gac gtq gac gqc aca cct gag teg tgg cag gcg ttg 4128 
Phe val Leu val Asp val Asp Gly Thr Pro Glu Ser Trp Gin Ala Leu 
1365 1370 1375 

ccg gcc gcc gtg cga gca gga gaa ccg cag ctg gca ctt egg cgc ggc 4176 
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Pro Ala Ala Val Arg Ala Gly Glu Pro Gin Leu Ala Leu Arg Arg Gly 

1380 1385 1390 

gtg gcg ctg gtg cct egg ttg gcg cga etc acg gtg cgc gag gag ggc 4224 

val Ala Leu val Pro Arg Leu Ala Arg Leu Thr vaT Arg Glu Glu Gly 
1395 ~ 1400 1405 

tec tec ccg caa etc gac acg gac ggg acc gtc etc ate acg ggt gqc 4272 

Ser Ser Pro Gin Leu Asp Thr Asp Gly Thr Val Leu lie Thr Gly Gly 
1410 1415 1420 

acc gqt gcg ttg ggg gga gtq gtt gee cgt cac ctg gtq gag gag cac 4320 

Thr Gly Ala Leu Gly Gly val val Ala Arg His Leu val Glu Glu His 

1425 1430 1435 1440 

ggg att egg cgt ttg gtq ttg gca gqc egg cgt gqc tgg aat gcg cct 4368 

Gly lie Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp Asn Ala Pro 

1445 1450 "* 1455 

gqa gtc cac gag ttg gtq gat gag ctg gcg cgc gcg gqc gee gtq gtt 4416 

Gly Val His Glu Leu val Asp Glu Leu Ala Arg Ala Gly Ala val Val 

1460 1465 1470 

gag gtq gtq get tgc gat gtq get gac cgc acc gat ctg gag cac gtq 4464 

Glu val val Ala cys Asp val Ala Asp Arg Thr Asp Leu Glu His val 
1475 1480 1485 

ctg gec gec att ccg gtc gac tgg ccg ctg egg ggg ate gtg cat acc 4512 

Leu Ala Ala lie Pro val Asp Trp Pro Leu Arg Gly lie Val His Thr 
1490 1495 1500 

get ggg gtg ctg gee gac gga gtg ate ggg tec ttg teg gcg gcg gat 4560 

Ala Gly Val Leu Ala Asp Gly Val lie Gly Ser Leu Ser Ala Ala Asp 

1505 1510 1515 1520 

gtq gqc acg gtq ttt gee ccg aag gtq acg ggg gca tgg cat ctg cac 4608 

val Gly Thr Val Phe Ala Pro Lys Val Thr Gly Ala Trp His Leu His 

1525 1530 1535 

gag ttg acc cgc gat ctg gat ctg teg ttc ttc gtt ctt ttc tct tec 4656 

Glu Leu Thr Arg Asp Leu Asp Leu Ser Phe Phe Val Leu Phe Ser Ser 

1540 1545 1550 

ttc tec ggg att gcg ggt gee gca ggg cag gec aac tac gcg gcg gcg 4704 

Phe Ser Gly lie Ala Gly Ala Ala Gly Gin Ala Asn Tyr Ala Ala Ala 
1555 1560 1565 

aac acg ttc ctg gat gca ttg gcg cgt tat cgc egg gcg cgt ggg ctg 4752 

Asn Thr Phe Leu Asp Ala Leu Ala Arg Tyr Arg Arg Ala Arg Gly Leu 
1570 1575 1580 

cct ggg ttg teg ttg gcg tgg gga ctg tgg gcg caa ccc age ggt atg 4800 

Pro Gly Leu Ser Leu Ala Trp Gly Leu Trp Ala Gin Pro Ser Gly Met 

1585 1590 1595 1600 

acg agt ggc ttg gac gcg gcg teg gtg gag egg ttg gcg egg acg ggc 4848 

Thr Ser Gly Leu Asp Ala Ala Ser Val Glu Arg Leu Ala Arg Thr Gly 

1605 1610 1615 

ate gca gaa ctt tec acg gag gat gga etc cgc ctg ttc gat gec gcg 4896 

lie Ala Glu Leu ser Thr Glu Asp Gly Leu Arg Leu Phe Asp Ala Ala 

1620 1625 1630 
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ttc gcg aag gac egg get tgc gtc gtt gec get cga ttg gac agg gcg 4944 
Phe Ala Lys Asp Arg Ala Cys Val Val Ala Ala Arg Leu Asp Arg Ala 
1635 ~ 1640 1645 



ctg ctg gtc gqg aac gga cga teg cac gcg att ccg gcg ctg ttg age 

Leu Leu val Gly Asn Gly Arg ser His Ala lie Pro Ala Leu Leu Ser 

1650 1655 1660 

gcg ttg gtt cct gtt cgc gqc gqt gtq gcg agg aaa aca gec aat tct 

Ala Leu val Pro val Arg Gly Gly val Ala Arg Lys Thr Ala Asn Ser 

1665 1670 1675 1680 



teg gee gtq ctg gqt tat teg gqt gcg gtc gag gtt gqg gqc gac cgt 
Ser Ala Val Leu Gly Tyr Ser Gly Ala val Glu Val Gly Gly Asp Arg 
1700 1705 1710 

get ttc cgt gat ctg gqt ttt gat teg ttg tct gqc gtq gag ttg egg 
Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly val Glu Leu Arg 
1715 1720 1725 



ctg gca gqc gag gtc gcg tec acg teg acg ccg gtq acc agg gca gcg 
Leu Ala Gly Glu Val Ala Ser Thr Ser Thr Pro Val Thr Arg Ala Ala 
1765 1770 1775 



ccg gqt gqg gtq teg teg ccg gag gag ctt tgg egg ctg gtq gee gqc 
Pro Gly Gly val Ser ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly 
1795 1800 1805 

gqc gtq gat gcg gtq get gqg ttc cca gac gat cgc gqc tgg gat etc 
Gly Val Asp Ala val Ala Gly Phe Pro Asp Asp Arg Gly Trp Asp Leu 
1810 1815 1820 



tgt gag gqc gqg ttt ctg egg gac gcg gcg gag ttc gat get gac atg 
Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Asp Met 
1845 1850 1855 



ttg ctg ctg gag gtc gec tgg gaa acc ttg gag egg get gqg ate gat 
Leu Leu Leu Glu val Ala Trp Glu Thr Leu Glu Arg Ala Gly lie Asp 
1875 1880 1885 
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4992 



5040 



cag gee gcg gat gag gac gca ctg ttg gqt ttg gtq egg gag cac gtt 5088 
Gin Ala Ala Asp Glu Asp Ala Leu Leu Gly Leu val Arg Glu His val 
1685 1690 1695 



5136 



5184 



aac cgc ctt gee ggg gtq ctg gqg gtq egg ttg ccg gcg act gcg gtq 5232 
Asn Arg Leu Ala Gly val Leu Gly val Arg Leu Pro Ala Thr Ala Val 
1730 1735 1740 

ttc gac tat ccg acg ccg egg gcg ctg gcg cgt ttc ctg cat cag gaa 5280 
Phe Asp Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu 
1745 1750 1755 1760 



5328 



agt gee gaa gag gat ctt gtt gcg att gtc gqg atg gga tgt cgt ttt 5376 
Ser Ala Glu Glu Asp Leu Val Ala lie val Gly Met Gly Cys Arg Phe 
1780 1785 1790 



5424 



5472 



gcg gcg ttg tac gat cct gat ccc gat cgt etc gqg acc teg tat gtq 5520 
Ala Ala Leu Tyr Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr val 
1825 1830 1835 1840 



5568 



ttc gqc ate age ccg cgt gag gcg ttg gcg atg gat ccg cag cag egg 5616 
Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
1860 " 1865 1870 



5664 
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ccg ttc teg ttg cac ggc age egg ace ggt gtg ttc gcg ggc ttg atg 5712 

Pro Phe Ser Leu His Gly Ser Arg Thr Gly val Phe Ala Gly Leu Met 
1890 1895 1900 

tac cac gac tat ggg gec cga ttc att acc aga gca ccg gag gqc ttc 5760 

Tyr His Asp Tyr Gly Ala Arg Phe lie Thr Arg Ala Pro Glu Gly Phe 
1905 1910 1915 1920 

gaa gqg cac etc gqg acg gqc aat gcg gqg age gtq ctg teg ggt egg 5808 

Glu Gly His Leu Gly Thr Gly Asn Ala Gly Ser Val Leu Ser Gly Arg 
1925 1930 1935 

gtt gcg tat teg ttt ggt ttc gag ggt cct gcg gtg acg gtg gat acg 5856 

val Ala Tyr ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
1940 1945 1950 

gcg tgt teg teg teg ttg gtq gcg tta cac ctg gcg ggt caa gca ctg 5904 

Ala Cys ser Ser ser Leu Val Ala Leu His Leu Ala Gly Gin Ala Leu 
1955 1960 1965 

egg gee ggt gag tgc gaa ttc gee ctt gee ggt ggc gtc acg gtg atg 5952 

Arg Ala Gly Glu Cys Glu Phe Ala Leu Ala Gly Gly val Thr Val Met 
1970 - 1975 1980 

teg acg ccg acg acg ttc gtg gag ttc tec cgt caa egg ggt ctg get 6000 

ser Thr Pro Thr Thr Phe val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
1985 1990 1995 2000 

ccg gat ggg egg tgc aag teg ttc gcg gcg gee gcg gat ggc acc ggg 6048 

Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly 
2005 2010 2015 

tgg ggc gag ggt gee ggt ctg gtg ttg ctg gag egg ttg teg gat gee 6096 

Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala 
2020 2025 2030 

egg cgc aat ggg cac gag gtt ctg gcg gtg gtg egg ggt age gcg gtg 6144 

Arg Arg Asn Gly His Glu val Leu Ala val val Arg Gly Ser Ala val 
2035 2040 2045 

aac cag gac ggc gcg teg aat ggc ttg act gcg cca aat ggt ccg tea 6192 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro ser 
2050 2055 2060 

cag caa agg gtg ate acc cag gca etc acg agt gee ggg ctg tec gtg 6240 

Gin Gin Arg val lie Thr Gin Ala Leu Thr Ser Ala Gly Leu Ser Val 
2065 ~ 2070 2075 2080 

tec gac gtg gat get gtg gag gcg cat ggg acg ggc acg egg ctt ggt 6288 

Ser Asp val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
2085 2090 2095 

gat ccg ate gag gcg cag gcg ttg ate get acg tac ggc egg gat cgt 6336 

Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg 
2100 2105 2110 

gat ccc ggt egg ccg ttg tgg ctg ggg teg gtg aag teg aat att ggt 6384 

Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly 
2115 2120 2125 

cac acc cag gcg gcg gcg ggt gtc get ggt gtg ate aag atg gtg atg 6432 

His Thr Gin Ala Ala Ala Gly val Ala Gly Val lie Lys Met val Met 
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2130 2135 2140 

gcg atg egg cag ggg gag ctg ccg cgc acg ttg cac gtg gac gag ccc 6480 
Ala Met Arg Gin Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro 
2145 2150 2155 2160 

tec gcg cag gtg gac tgg tct gcg ggc acg gtc caa etc etc acg gag 6528 
Ser Ala Gin val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu 
2165 2170 2175 

aac acg ccc tgg ccc gac age ggt cgt ctt cgc egg gcg gqc gtg tea 6576 
Asn Thr pro Trp Pro Asp Ser Gly Arg Leu Arg Arg Ala Gly val Ser 
2180 2185 "* 2190 

teg ttc ggg ate agt ggc acc aac gcg cac ctg ate ctt gaa caa cct 6624 
Ser Phe Gly lie Ser Gly Thr Asn Ala His Leu lie Leu Glu Gin Pro 
2195 2200 2205 

ccg cga gag teg cag cgc tea aca gag ccg gat teg ggt tct gtc cgc 6672 
Pro Arg Glu Ser Gin Arg Ser Thr Glu Pro Asp Ser Gly Ser Val Arg 
2210 2215 2220 

gat ttt ccg gtg gtg ccg tgg atg gtg teg ggc aaa aca ccc gaa gcg 6720 
Asp Phe Pro val val Pro Trp Met val Ser Gly Lys Thr Pro Glu Ala 
2225 2230 2235 2240 

eta tec gee cag gca gat gca ttg atg tec tac ttg age aat cgc gtt 6768 
Leu Ser Ala Gin Ala Asp Ala Leu Met Ser Tyr Leu Ser Asn Arg Val 
2245 2250 2255 

gat get tec ccg cga gat ate ggt tat teg ctt gcg gtg acc cgt ccg 6816 
Asp Ala Ser Pro Arg Asp lie Gly Tyr Ser Leu Ala Val Thr Arg Pro 
2260 2265 2270 

gcg ttg gac cac cgc get gtc gtg ctg ggt gcg gat cgt gec gcg ttg 6864 
Ala Leu Asp His Arg Ala Val Val Leu Gly Ala Asp Arg Ala Ala Leu 
2275 2280 2285 

ctg ccg ggc ttg aaa gcg ctg gec gtt agt aat gac get gee gag gtg 6912 
Leu Pro Gly Leu Lys Ala Leu Ala Val Ser Asn Asp Ala Ala Glu Val 
2290 2295 2300 

ate acc ggc act cgt gee get ggg ccg gtc gga ttc gtg ttc tec ggt 6960 
lie Thr Gly Thr Arg Ala Ala Gly Pro val Gly Phe Val Phe Ser Gly 
2305 2310 2315 2320 

caa ggt ggt cag tgg ccc ggg atg gga age ggg etc cac teg gcg ttt 7008 
Gin Gly Gly Gin Trp Pro Gly Met Gly Ser Gly Leu His Ser Ala Phe 
2325 2330 2335 

ccg gtg ttc gee gac gcg ttt gac gaa gec tgc tgc gag ctg gat gcg 7056 
Pro val Phe Ala Asp Ala Phe Asp Glu Ala Cys Cys Glu Leu Asp Ala 
2340 2345 2350 

cat etc ggg cag atg gec egg eta cga gat gtg ttg tec ggt teg gat 7104 
His Leu Gly Gin Met Ala Arg Leu Arg Asp Val Leu Ser Gly ser Asp 
2355 2360 2365 

acg caa ctt ctg gac cag acc ttg tgg gcg cag ccg ggc ctg ttc gcg 7152 
Thr Gin Leu Leu Asp Gin Thr Leu Trp Ala Gin Pro Gly Leu Phe Ala 
2370 2375 2380 

ttg caa gtc gga etc tgg gag ttg ttg ggt teg tgg ggt gtc egg ccc 7200 
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Leu Gin val Gly Leu Trp Glu Leu Leu Gly Ser Trp Gly Val Arg Pro 
2385 2390 2395 2400 

get gtg gtg ctg ggc cac teg gtc ggt gag ctg gcg gcg gcg ttc gcg 7248 
Ala val val Leu Gly His Ser val Gly Glu Leu Ala Ala Ala Phe Ala 
2405 2410 2415 

get gga gtq ttg teg ttg egg gat gcg get egg ctg gtq gcg gqc cgt 7296 
Ala Gly val Leu Ser Leu Arg Asp Ala Ala Arg Leu val Ala Gly Arg 
2420 " 2425 2430 

gec egg ttg atg caa gee ctg cca act gqc ggt gec atg etc get gcg 7344 
Ala Arg Leu Met Gin Ala Leu Pro Thr Gly Gly Ala Met Leu Ala Ala 
2435 2440 2445 

get get gga gag gag cag ctg cgc ccg ttg ctg gec gac tgc ggt gat 7392 
Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala Asp Cys Gly Asp 
2450 2455 2460 

cgt gtg ggg ate gec gcg gtc aac get ccc ggg teg gtg gtg etc tec 7440 
Arg val Gly lie Ala Ala val Asn Ala Pro Gly Ser Val val Leu Ser 
2465 2470 2475 2480 

ggt gat egg gat gtg etc gat gac att gec ggt egg ctg gac ggg caa 7488 
Gly Asp Arg Asp Val Leu Asp Asp lie Ala Gly Arg Leu Asp Gly Gin 
2485 2490 2495 

ggg ate egg tec agg tgg ttg egg gtt teg cat gcg ttt cat teg cat 7536 
Gly lie Arg ser Arg Trp Leu Arg val Ser His Ala Phe His Ser His 
2500 2505 2510 

egg atg gat ccg atg ctg gcg gag ttc ace gaa ate gec egg age gtg 7584 
Arg Met Asp Pro Met Leu Ala Glu Phe Thr Glu lie Ala Arg Ser Val 
2515 2520 2525 

gac tac egg teg tea ggg ctg ccg ate gtg teg acg ttg acg ggt gag 7632 
Asp Tyr Arg Ser Ser Gly Leu Pro lie Val Ser Thr Leu Thr Gly Glu 
2530 2535 2540 

etc gat gag gtc ggc atg ccg get acg ccg gag tat tgg gtg cgc cag 7680 
Leu Asp Glu val Gly Met Pro Ala Thr Pro Glu Tyr Trp val Arg Gin 
2545 2550 2555 2560 

gtg cga gaa ccc gtc cgc ttc gec gac ggt gtt get gcg etc gcg get 7728 
Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Ala Ala Leu Ala Ala 
2565 ~ 2570 2575 

cac ggt gtg age acc gtc gtc gag gtc ggt ccg gat ggg gtg ttg teg 7776 
His Gly val Ser Thr val val Glu Val Gly Pro Asp Gly val Leu Ser 
2580 2585 2590 

gcg ctg gtg cag gag tgc gcg gee gga tec gat cag ggc gga egg gtg 7824 
Ala Leu val Gin Glu Cys Ala Ala Gly Ser Asp Gin Gly Gly Arg Val 
2595 2600 2605 

gec gcg gtt ccg etc atg cgc age aat cgc gac gag gcg cac acg gtg 7872 
Ala Ala val Pro Leu Met Arg Ser Asn Arg Asp Glu Ala His Thr val 
2610 2615 ~ 2620 

aca acg gca ttg gcg cag ate cat gtg cgt ggt get gag gtg gac tgg 7920 
Thr Thr Ala Leu Ala Gin lie His Val Arg Gly Ala Glu val Asp Trp 
2625 2630 2635 2640 
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egg teg ttt ttc gee gqt ace gqg gca aag cag gtc gag ctg ccc acg 7968 

Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin val Glu Leu Pro Thr 
2645 2650 2655 

tat gec ttc caa cga cag egg tac tgg ctt gac tea cca tec gaa ccg 8016 

Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ser Pro Ser Glu Pro 
2660 2665 2670 

gtc gqg caa tec gee gat ccc gcg cgc cag teg gqc ttc tgg gaa etc 8064 

val Gly Gin Ser Ala Asp Pro Ala Arg Gin Ser Gly Phe Trp Glu Leu 
2675 2680 ~ 2685 

gtc gag cag gaa gat gtc age gcg etc age gee get ctg cac att acc 8112 

val Glu Gin Glu Asp Val Ser Ala Leu Ser Ala Ala Leu His lie Thr 
2690 2695 2700 

gqc gat cac gac gtq cag gcg tec ctg gaa teg gtq gtt ccg gtc etc 8160 

Gly Asp His Asp val Gin Ala Ser Leu Glu Ser Val Val Pro val Leu 
2705 2710 2715 2720 

tec tec tgg cat cgc egg ate cgc aac gaa tec ctg gtq cac cag tgg 8208 

Ser Ser Trp His Arg Arg lie Arg Asn Glu Ser Leu Val His Gin Trp 
2725 ~ 2730 2735 

egg tac egg att tec tgg cat gag egg gca gat ttg cca gac ccc teg 8256 

Arg Tyr Arg lie Ser Trp His Glu Arg Ala Asp Leu Pro Asp Pro Ser 
2740 2745 2750 

ttg teg gqg aca tgg etc gtc gtc gtq ccg gag gqg tgg teg gcg agt 8304 

Leu Ser Gly Thr Trp Leu Val Val Val Pro Glu Gly Trp Ser Ala Ser 
2755 2760 2765 

egg caa gtt ctg cgt ttc aac gag atg ttc gag gaa egg gqt tgc ccg 8352 

Arg Gin val Leu Arg Phe Asn Glu Met Phe Glu Glu Arg Gly Cys Pro 
2770 ~ 2775 2780 

gca gtt ctg ttc gag etc gee gqg cac gac gag gaa gee ctg gcg caa 8400 

Ala val Leu Phe Glu Leu Ala Gly His Asp Glu Glu Ala Leu Ala Gin 
2785 2790 2795 2800 

cga ttc cgc teg ttg cct gtt gcg tea gqg gqa ata age gqc gtq ttg 8448 

Arg Phe Arg Ser Leu Pro Val Ala Ser Gly Gly lie Ser Gly Val Leu 
2805 2810 2815 

tec ttg ctg gcg ctg gat gaa teg ccg tec teg ccg aac get get ttg 8496 

Ser Leu Leu Ala Leu Asp Glu Ser Pro Ser Ser Pro Asn Ala Ala Leu 
2820 2825 2830 

ccg aat gqc gcg ctg aac teg ttg gta ctg ctg cga get ctg egg gee 8544 

Pro Asn Gly Ala Leu Asn Ser Leu Val Leu Leu Arg Ala Leu Arg Ala 
2835 2840 2845 

gcg gat gtq teg gcg cca ttg tgg ttg gcg acg tgt gqt gqt gtc gcg 8592 

Ala Asp val ser Ala Pro Leu Trp Leu Ala Thr cys Gly Gly Val Ala 
2850 2855 2860 

gtc gqg gat gtq ccg gtq aac ccg gqg cag gcg ctg gtq tgg gqa ctg 8640 

val Gly Asp val Pro Val Asn Pro Gly Gin Ala Leu val Trp Gly Leu 
2865 2870 2875 2880 

gqt cgc gtc gtc gqt ctg gag cat ccg gec tgg tgg gqt gqc ctg gtc 8688 

Gly Arg Val Val Gly Leu Glu His Pro Ala Trp Trp Gly Gly Leu Val 
2885 2890 2895 
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gac gtq ccg tgc ttg etc gat gag gac get cga gaa cgc ttg teg gtc 8736 

Asp Val Pro Cys Leu Leu Asp Glu Asp Ala Arg Glu Arg Leu Ser Val 
2900 2905 ~ 2910 

gtq ttg gca gqt ctt gqc gag gac gag ate gcg gta cgt ccc gqt ggt 8784 

val Leu Ala Gly Leu Gly Glu Asp Glu lie Ala Val Arg Pro Gly Gly 
2915 2920 2925 

gtq ttc gtq egg egg ttg gaa cgc get gqt gcg gcg teg gqt gee gqg 8832 

Val Phe val Arg Arg Leu Glu Arg Ala Gly Ala Ala Ser Gly Ala Gly 
2930 2935 2940 



teg gtq tgg cgt cct egg gqg acg gtq ttg gtq acg gqt gqt acg gqc 

Ser val Trp Arg Pro Arg Gly Thr val Leu Val Thr Gly Gly Thr Gly 

2945 2950 2955 2960 

gqt ttg gqg gcg cat gtt gee egg tgg ttg gcg gqt gee gqg get gag 

Gly Leu Gly Ala His Val Ala Arg Trp Leu Ala Gly Ala Gly Ala Glu 

2965 2970 2975 

cat gtq gtq ttg ace age cgt cga gqc gcg gcg get ccg gqc get gqa 

His val val Leu Thr ser Arg Arg Gly Ala Ala Ala Pro Gly Ala Gly 

2980 2985 2990 



gtq tgg gqt get gqc gqg cag gqc get tat gcg gcg gcg aat gee tac 
val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Tyr 
3090 3095 3100 



gcg gtt gcg tgg gqc ccg tgg gee gqt gac gqa atg gee gca gqt gaa 
Ala val Ala Trp Gly Pro Trp Ala Gly Asp Gly Met Ala Ala Gly Glu 
3125 3130 3135 

gqc gqt gca cag ctg cgc egg gec gqc ctg gtq cca atg get gcg gat 
Gly Gly Ala Gin Leu Arg Arg Ala Gly Leu val Pro Met Ala Ala Asp 
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8880 



8928 



8976 



gat ttg egg gcg gag ctg gag gcg ctg gqc get egg gtt teg ate acg 9024 

Asp Leu Arg Ala Glu Leu Glu Ala Leu Gly Ala Arg val ser lie Thr 
2995 3000 3005 

gee tgc gac gtq gee gat cgt gac get ttg gee gaa gtq ttg gcg ace 9072 

Ala cys Asp val Ala Asp Arg Asp Ala Leu Ala Glu Vat Leu Ala Thr 
3010 3015 3020 

att ccg gat gat tgc ccg ctg acc gcg gtq atg cat gcg gcg gqg gtc 

lie Pro Asp Asp Cys Pro Leu Thr Ala Val Met His Ala Ala Gly Val 
3025 3030 3035 3040 



9120 



gtt gaa gtc gqc gac gtq gcg teg atg tgt ttg acc gac ttc gtt gqg 9168 

val Glu val Gly Asp val Ala Ser Met Cys Leu Thr Asp Phe val Gly 
3045 3050 3055 

gtq ctg teg gcg aag gca gqt gqt gcg gcg aat etc gat gag ttg etc 9216 

val Leu Ser Ala Lys Ala Gly Gly Ala Ala Asn Leu Asp Glu Leu Leu 
3060 3065 3070 

gee gat gtc gag ctg gat gee ttc gtq ctg ttc tea tec gtc teg gqt 9264 

Ala Asp val Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Val Ser Gly 
3075 3080 3085 



9312 



ttg gat gcg ttg gcg cag cag cgt egg gca agg gqg ttg gtq gqg act 9360 
Leu Asp Ala Leu Ala Gin Gin Arg Arg Ala Arg Gly Leu Val Gly Thr 
3105 3110 3115 3120 



9408 



9456 
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3140 3145 3150 

egg gcg ttg ctg gca ctt cag ggc gca ttg gat cgt gac gag aca tec 9504 

Arg Ala Leu Leu Ala Leu Gin Gly Ala Leu Asp Arg Asp Glu Thr ser 
3155 3160 3165 

ctg gtc gtg gec gat atg gcg tgg gag agg ttc gec ccg gtg ttc gec 9552 

Leu val val Ala Asp Met Ala Trp Glu Arg Phe Ala Pro Val Phe Ala 
3170 3175 "* 3180 

atg tec cgt egg cgt ccg ctg etc gac gag ctg ccc gaa gca cag cag 9600 

Met Ser Arg Arg Arg Pro Leu Leu Asp Glu Leu Pro Glu Ala Gin Gin 
3185 3190 3195 3200 

gcg ttg gcg gat gcg gag aac ace act gat get gcg gac teg gec gtc 9648 

Ala Leu Ala Asp Ala Glu Asn Thr Thr Asp Ala Ala Asp Ser Ala Val 
3205 3210 3215 

ccg eta ccg egg etc gcg ggc atg gca gee gec gaa cgc cgc cgc gcg 9696 

Pro Leu Pro Arg Leu Ala Gly Met Ala Ala Ala Glu Arg Arg Arg Ala 
3220 3225 3230 

atg ctg gac ctg gtq ctg gcg gag gee teg att gtq ttg gga cac aac 9744 

Met Leu Asp Leu val Leu Ala Glu Ala Ser lie Val Leu Gly His Asn 
3235 3240 3245 

ggg tct gac cca gtt ggt ccc gac egg gcg ttc cag gag etc gqa ttt 9792 

Gly Ser Asp Pro val Gly Pro Asp Arg Ala Phe Gin Glu Leu Gly Phe 
3250 3255 3260 

gat teg ctg atg gee gtc gaa ctg cgc aac agg ttg gqc gag gca aca 9840 

Asp Ser Leu Met Ala val Glu Leu Arg Asn Arg Leu Gly Glu Ala Thr 
3265 3270 3275 3280 

gqa ttg agt ctg ccg gee acg ttg ate ttc gat tat ccg age cca tec 9888 

Gly Leu Ser Leu Pro Ala Thr Leu lie Phe Asp Tyr Pro Ser Pro Ser 
3285 3290 3295 

gcg ctg get gag cag ctg gtc gqc gag ctg gtq gqa gcg cag ccc gcg 9936 

Ala Leu Ala Glu Gin Leu val Gly Glu Leu val Gly Ala Gin Pro Ala 
3300 3305 3310 

ace ace gtc gtq gee ggg gee gat cca gtg gat gat ccg gtt gtc gtg 9984 

Thr Thr val Val Ala Gly Ala Asp Pro Val Asp Asp Pro val val val 
3315 3320 3325 

gtc gcg atg gqa tgc egg tat ccg ggc gac gtc tgc teg ccc gag gag 10032 

Val Ala Met Gly Cys Arg Tyr Pro Gly Asp Val Cys Ser Pro Glu Glu 
3330 3335 3340 

ctg tgg cag ctg gtt tct gcg gga cgt gat gcg gta teg acg ttc ccc 10080 

Leu Trp Gin Leu Val Ser Ala Gly Arg Asp Ala Val Ser Thr Phe Pro 
3345 3350 3355 3360 

gtc gat egg ggt tgg gac tgc aac acg ttg ttc gac ccg gat ccg gat 10128 

Val Asp Arg Gly Trp Asp Cys Asn Thr Leu Phe Asp Pro Asp Pro Asp 
3365 3370 3375 

egg gca ggc agt ace tat gtg cga gaa ggt gec ttc ctg acc ggt get 10176 

Arg Ala Gly ser Thr Tyr val Arg Glu Gly Ala Phe Leu Thr Gly Ala 
3380 3385 3390 

gat egg ttc gac gee ggg ttc ttc ggc ate age cct cgc gag gcg cgc 10224 
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Asp Arg Phe Asp Ala Gly Phe Phe Gly lie Ser Pro Arg Glu Ala Arg 
3395 3400 3405 

gca atg gat ccg cag cag agg ttg ttg etc gaa gtq gcg tgg gag gtt 10272 

Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu VaT Ala Trp Glu Val 
3410 3415 3420 

ttc gaa cga gca gqa ate get ccg ctg teg ttg egg gqt age agg acc 10320 

Phe Glu Arg Ala Gly lie Ala Pro Leu Ser Leu Arg Gly Ser Arg Thr 

3425 3430 3435 3440 

gqt gtq ttc gcg gqg acc aat gqg cag gac cac gqt gcg aaa gtq get 10368 

Gly VaT Phe Ala Gly Thr Asn Gly Gin Asp His Gly Ala Lys val Ala 
3445 3450 3455 

gee gcg ccg gag gcg gcg gqt cac etc ctg acc gqa aac gee gcg agt 10416 

Ala Ala Pro Glu Ala Ala Gly His Leu Leu Thr Gly Asn Ala Ala Ser 

3460 3465 3470 

gtc ctg gec gqc egg ctt tec tac acg ttc gqc ctt gag gqg cct gcg 10464 

val Leu Ala Gly Arg Leu Ser Tyr Thr Phe Gly Leu Glu Gly Pro Ala 
3475 3480 3485 

gtq gcg gtq gat acc gcg tgt teg teg teg ttg gtq gcg ttg cat ttg 10512 

val Ala val Asp Thr Ala Cys Ser Ser Ser Leu VaT Ala Leu His Leu 
3490 3495 3500 

gcg tgc cag teg ctg cgt teg gqt gag tgt gat atg gcg ttg gca gqt 10560 

Ala Cys Gin Ser Leu Arg Ser GTy Glu Cys Asp Met Ala Leu Ala GTy 

3505 3510 3515 3520 

ggt gtq acg gtq atg teg aca ccc ctg get ttc etc gag ttc tct cgt 10608 

GTy VaT Thr VaT Met Ser Thr Pro Leu Ala Phe Leu Glu Phe Ser Arg 
3525 3530 3535 

cag cgc gqt ttg gcg cca gat gqt egg tgc aag teg ttt gcg gee get 10656 

Gin Arg GTy Leu Ala Pro Asp GTy Arg Cys Lys Ser Phe Ala Ala Ala 

3540 3545 3550 

gcg gat gqc acc gqg tgg gqt gag gqt gee gqc ctg gtq ttg ctg gag 10704 

Ala Asp GTy Thr GTy Trp GTy Glu GTy Ala GTy Leu VaT Leu Leu Glu 
3555 3560 3565 

egg ttg teg gat get cgt egg aat gqt cac egg gtq ttg gec gtq gtt 10752 

Arg Leu Ser Asp Ala Arg Arg Asn GTy His Arg VaT Leu Ala VaT val 
3570 3575 3580 

cgc gqg tct gcg gtq aat cag gat gqt gcg teg aat gqc ctg act gcg 10800 

Arg GTy Ser Ala VaT Asn Gin Asp GTy Ala Ser Asn GTy Leu Thr Ala 

3585 3590 3595 3600 

ccg aat gqt ccg teg cag cag egg gtq att egg cag gec etc gcg aat 10848 

Pro Asn GTy pro Ser Gin Gin Arg VaT lie Arg Gin Ala Leu Ala Asn 
3605 3610 ~ 3615 

gcg ggg ctg teg gcg tec gat gtq gat gtc gtq gag gcg cac gqg acc 10896 

Ala GTy Leu ser Ala Ser Asp vaT Asp val VaT Glu Ala His GTy Thr 

3620 3625 3630 

gqt acc ggg etc gqg gat ccg ate gag gcg cag gcg ctg ate gcg aca 10944 

GTy Thr GTy Leu GTy Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr 
3635 3640 3645 
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tat gqg cag gag egg gat cct gag egg gec ctg tgg ctg gqg teg ate 

Tyr Gly Gin Glu Arg Asp Pro Glu Arg Ala Leu Trp Leu Gly Ser lie 
3650 3655 3660 

aag tec aac ate gqc cac acg cag gcg gcg gec gqt gtq gcg gqg gtc 

Lys Ser Asn lie Gly His Thr Gin Ala Ala Ala Gly val Ala Gly Val 

3665 3670 3675 3680 



cac gtq gac aag ccc act cca cag gtq gac tgg tct gec gqg gec gtt 
His val Asp Lys Pro Thr Pro Gin Val Asp Trp Ser Ala Gly Ala val 
3700 3705 3710 

egg etc etc acc gqg aac acg ccc tgg ccc gag age gqc cgt cct cgt 
Arg Leu Leu Thr Gly Asn Thr Pro Trp Pro Glu Ser Gly Arg Pro Arg 
3715 3720 3725 



gcg acc act cgt tct gtq ctg gac gaa cgc gec gtc gtq tgg gqt gee 
Ala Thr Thr Arg Ser Val Leu Asp Glu Arg Ala val Val Trp Gly Ala 
3810 3815 3820 



gqa ttc gtt ttt tec gqt cag gqc agt cag tgg ctg gqg atg gqa aag 
Gly Phe Val Phe Ser Gly Gin Gly Ser Gin Trp Leu Gly Met Gly Lys 



10992 



11040 



ate aag atg gtq cag gec atg egg cac gqg gag ttg cct gcg acg ttg 11088 
lie Lys Met val Gin Ala Met Arg His Gly Glu Leu Pro Ala Thr Leu 
3685 "* 3690 3695 



11136 



11184 



cga gcg gqg gtq teg teg ttc gqg ate age gqc acc aac gca cac etc 11232 
Arg Ala Gly Val Ser Ser Phe Gly lie Ser Gly Thr Asn Ala His Leu 
3730 3735 3740 

ate etc gaa caa cca ccg teg gaa cca gcg gag ate gac caa teg gat 11280 
lie Leu Glu Gin Pro Pro Ser Glu Pro Ala Glu lie Asp Gin ser Asp 
3745 3750 3755 3760 

egg egg gtc act gcg cat cca gcg gtq ate ccg tgg atg ttg teg get 11328 
Arg Arg val Thr Ala His Pro Ala val lie Pro Trp Met Leu Ser Ala 
3765 3770 3775 

agg agt etc gca gcg ctg cag gee caa gcg get gcg ctg cag gee egg 11376 
Arg Ser Leu Ala Ala Leu Gin Ala Gin Ala Ala Ala Leu Gin Ala Arg 
3780 3785 3790 

ctg gac egg gqt cct gqc get tct ccg ctg gat ttg gqg tat tea etc 11424 
Leu Asp Arg Gly Pro Gly Ala Ser Pro Leu Asp Leu Gly Tyr Ser Leu 
3795 3800 3805 



11472 



gat egg gag gca ctg ctg tec agg ctg gca gcg etc gee gat gqc egg 11520 

Asp Arg Glu Ala Leu Leu Ser Arg Leu Ala Ala Leu Ala Asp Gly Arg 

3825 3830 ~ 3835 3840 

acg gcg ccg gqg gtq ata acg gqc tct gcg aat tec gqt gqc cgc ate 11568 

Thr Ala Pro Gly Val lie Thr Gly Ser Ala Asn Ser Gly Gly Arg lie 

3845 3850 * 3855 



11616 



gcg ttg tgc gcg get ttc ccg gcg ttc gcg gac gec ttc gag gaa gec 11664 
Ala Leu Cys Ala Ala Phe Pro Ala Phe Ala Asp Ala Phe Glu Glu Ala 
3875 3880 3885 

tgc gac gcg eta age gca cac ctg gqc gcg gac gtt egg gqt gtq ctg 11712 
Cys Asp Ala Leu Ser Ala His Leu Gly Ala Asp Val Arg Gly Val Leu 
3890 3895 3900 
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ttc gqt get gat gag cag atg etc gac egg acg ctg tgg gcg cag teg 11760 

Phe Gly Ala Asp Glu Gin Met Leu Asp Arg Thr Leu Trp Ala Gin Ser 
3905 3910 3915 3920 

gqg ate ttc gcg gtt caa gtc gqc etc ctg gqa ttg ctg agg teg tgg 11808 

Gly lie Phe Ala val Gin Val Gly Leu Leu Gly Leu Leu Arg Ser Trp 
3925 ' 3930 3935 

gqc gtq egg ccg gee gcg gtq ctg gqg cac teg gtc gqc gag ttg get 11856 

Gly val Arg Pro Ala Ala val Leu Gly His Ser val Gly Glu Leu Ala 
3940 3945 3950 

gcg gcg cac gcg get gqt gtq ttg tec ttg ccg gac get gca egg ttg 11904 

Ala Ala His Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg Leu 
3955 3960 3965 

gtt gcg get egg gee cac ctg atg cag gca ttg ccc acc gqc gqc gca 11952 

Val Ala Ala Arg Ala His Leu Met Gin Ala Leu Pro Thr Gly Gly Ala 
3970 3975 3980 

atg etc gcg gtc gee acc age gag gcg gcg gtc gqa ccg ctg ctt tec 12000 

Met Leu Ala Val Ala Thr Ser Glu Ala Ala Val Gly Pro Leu Leu Ser 
3985 3990 3995 4000 

gqg gtq tgc gat egg gtc age ate get gcg ate aac gqc ccc gag teg 12048 

Gly val cys Asp Arg val Ser lie Ala Ala lie Asn Gly Pro Glu Ser 
4005 4010 4015 

gta gtq etc tec gqc gac cgc gat gtq etc gtq gag etc gca gqc gaa 12096 

val val Leu Ser Gly Asp Arg Asp Val Leu Val Glu Leu Ala Gly Glu 
4020 4025 4030 

ttc gat gee cga gqg ctt agg acc aaa tgg ttg egg gtc tec cat get 12144 

Phe Asp Ala Arg Gly Leu Arg Thr Lys Trp Leu Arg val Ser His Ala 
4035 4040 4045 

ttc cac teg cac egg atg gaa ccg att ctg gac gag tac gcg gaa acc 12192 

Phe His Ser His Arg Met Glu Pro lie Leu Asp Glu Tyr Ala Glu Thr 
4050 4055 4060 

gec agg tgc gtc gag ttc gqt gaa ccg gtq gtq ccg ate gtc tec gee 12240 

Ala Arg Cys Val Glu Phe Gly Glu Pro Val val Pro lie Val Ser Ala 
4065 4070 4075 4080 

gcg acc gqt gcg ctg gac acc acc gqa ctg atg tgc gcg gee gac tac 12288 

Ala Thr Gly Ala Leu Asp Thr Thr Gly Leu Met Cys Ala Ala Asp Tyr 
4085 4090 4095 

tgg acg cgc caa gtq cgt gat cct gtc cgc ttc gqa gac gqt gtc egg 12336 

Trp Thr Arg Gin val Arg Asp Pro val Arg Phe Gly Asp Gly Val Arg 
4100 "* 4105 4110 

gcg etc gtc gqc caa gqc gtq gac acg ate gtc gag ttc gqc ccg gac 12384 

Ala Leu val Gly Gin Gly val Asp Thr lie Val Glu Phe Gly Pro Asp 
4115 4120 4125 

gqg gcg ttg teg gec ctg gtc gag cag tgc ttg gec gqg tec gac cag 12432 

Gly Ala Leu Ser Ala Leu Val Glu Gin Cys Leu Ala Gly Ser Asp Gin 
4130 4135 4140 

get gqg agg gtq gcg gcg ate ccg ctg atg cgc agg gac cgc gat gag 12480 

Ala Gly Arg val Ala Ala lie Pro Leu Met Arg Arg Asp Arg Asp Glu 
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4145 4150 4155 4160 

gtc gag acc gcg gtq gcg gcc ctg gcg cac gtq cac gtc cgc gqt gqt 12528 
val Glu Thr Ala val Ala Ala Leu Ala His val His Val Arg Gly Gly 
4165 4170 4175 

gcg gtq gac tgg teg get tgc ttc gcc gqc acc gqc gcc cgc acc gtc 12576 
Ala val Asp Trp Ser Ala Cys Phe Ala Gly Thr Gly Ala Arg Thr Val 
4180 4185 4190 

gag ttg ccc acc tac gcc ttc caa cgc cag egg tac tgg ctg gcc gqg 12624 
Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Ala Gly 
4195 4200 4205 



caa gcg gac gqg cgc gqc gqc gat gtq gtt gcc gac ccg gtc gac gcg 
Gin Ala Asp Gly Arg Gly Gly Asp val val Ala Asp Pro val Asp Ala 
4210 " 4215 4220 



tgg cgc tac cag gtq cgc tgg agg tec gtc gag gtq ccg tec gca gcc 

Trp Arg Tyr Gin val Arg Trp Arg Ser val Glu val Pro ser Ala Ala 
4275 4280 4285 

gcc etc egg gqc gtq tgg ctg gtq gtq ctt cca get gac gtq ccc cga 

Ala Leu Arg Gly Val Trp Leu Val Val Leu Pro Ala Asp val Pro Arg 
4290 4295 4300 



gtq gac aag gtq cgc gcc gtc att gcg gac cgc acc gag gtq acg gqt 

val Asp Lys val Arg Ala val lie Ala Asp Arg Thr Glu val Thr Gly 
4340 ~ 4345 ~ 4350 

gtq ttg tct ctg ttg gcg atg gac gqc atg ccc tgc gcg gcg cat ccg 

val Leu ser Leu Leu Ala Met Asp Gly Met Pro Cys Ala Ala His Pro 
4355 4360 4365 



12672 



cgc ttc tgg gag ttg gtc gag cgc gcc gat ccg gaa ccg ttg gtq gat 12720 

Arg Phe Trp Glu Leu val Glu Arg Ala Asp Pro Glu Pro Leu val Asp 
4225 4230 4235 4240 

gaa etc tgc ate gac egg gac cag ccc ttc egg gag gtq ctg ccc gtt 12768 

Glu Leu Cys lie Asp Arg Asp Gin Pro Phe Arg Glu Val Leu Pro val 
4245 " 4250 " 4255 

ctg get tec tgg cgc gag aaa caa cgc cag gag gcc etc gcg gat tec 12816 

Leu Ala Ser Trp Arg Glu Lys Gin Arg Gin Glu Ala Leu Ala Asp Ser 
4260 4265 4270 



12864 



12912 



gat caa ccg gcg gtc gtc ate gac gcg ctg ate gcg cgc gqc gcc gag 12960 

Asp Gin Pro Ala Val Val lie Asp Ala Leu lie Ala Arg Gly Ala Glu 
4305 4310 4315 4320 

gtc gcg gtc ctg gaa ttg acc gag cag gac etc caa cgc agt gcg ctt 13008 

val Ala val Leu Glu Leu Thr Glu Gin Asp Leu Gin Arg ser Ala Leu 
4325 4330 4335 



13056 



13104 



cac ctg tec cgt gqt gtc gcc get acc gtq ate ctg acg cag gtq ttg 13152 

His Leu Ser Arg Gly Val Ala Ala Thr Val lie Leu Thr Gin Val Leu 
4370 4375 4380 

gqc gat gcg gqt gtt tec gcc ccg ctg tgg ctg gcc acg acc gqt gqc 13200 

Gly Asp Ala Gly val Ser Ala Pro Leu Trp Leu Ala Thr Thr Gly Gly 

4385 4390 4395 4400 

gtc gag gcc ggg acc gag gac ggt ccg gcc gat ccg gac cac ggc ttg 13248 
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val Glu Ala Gly Thr Glu Asp Gly Pro Ala Asp Pro Asp His Gly Leu 
4405 4410 4415 

ate tgg gqg etc gqc agg gtc gtc gqc ctt gaa cat ccg cag tgg tgg 13296 

lie Trp Gly Leu Gly Arg val val Gly Leu Glu His Pro Gin Trp Trp 

4420 4425 4430 

gqt gqc ctg ate gac ctt ccg gag aca ctg gac gag acg tec egg aac 13344 

Gly Gly Leu lie Asp Leu Pro Glu Thr Leu Asp Glu Thr Ser Arg Asn 
4435 4440 4445 

gqg ttg gtq gee gca etc gee gqg acg gcg gee gaa gat cag etc gec 13392 

Gly Leu val Ala Ala Leu Ala Gly Thr Ala Ala Glu Asp Gin Leu Ala 
4450 4455 4460 

gtq cgt tea tec gqg ttg ttc gtt cgc aga gtq gtq cgc gca gcg egg 13440 

val Arg ser Ser Gly Leu Phe val Arg Arg Val Val Arg Ala Ala Arg 

4465 4470 4475 " 4480 

aac ccc egg tea gag aca tgg cgt age egg gqa acg gtc etc ate acg 13488 

Asn Pro Arg Ser Glu Thr Trp Arg Ser Arg Gly Thr val Leu lie Thr 
4485 ~ 4490 4495 

gqc gqa aca gqc gcg etc gqt gee gag gtc gca cga tgg ctg gee egg 13536 

Gly Gly Thr Gly Ala Leu Gly Ala Glu val Ala Arg Trp Leu Ala Arg 

4500 4505 4510 

egg gqa get gag cac ctg gtq ttg ate agt cgc cgc gqc ccg gaa get 13584 

Arg Gly Ala Glu His Leu val Leu lie Ser Arg Arg Gly Pro Glu Ala 
4515 4520 ~ 4525 

ccc gqc gca gcg gac eta gqg gee gag ctg act gaa etc gqc gtq aaa 13632 

Pro Gly Ala Ala Asp Leu Gly Ala Glu Leu Thr Glu Leu Gly val Lys 
4530 4535 4540 

gtc aca gtc ttg gec tgc gat gtq acg gac cgc gac gag ctg gcg gcg 13680 

val Thr Val Leu Ala Cys Asp Val Thr Asp Arg Asp Glu Leu Ala Ala 
4545 4550 4555 4560 

gtq ctg gcg gec gtt ccc acg gag tat ccg ctg teg gcg gtc gtq cac 13728 

Val Leu Ala Ala Val Pro Thr Glu Tyr Pro Leu Ser Ala Val Val His 
4565 4570 4575 

acc gec gqc gtc gqg acg cct gcg aac ctg gec gag acg ace ttg gcg 13776 

Thr Ala Gly val Gly Thr Pro Ala Asn Leu Ala Glu Thr Thr Leu Ala 

4580 4585 4590 

cag ttc gec gac gtq ttg teg gec aag gtc gtc gqc gcg gcg aac ctg 13824 

Gin Phe Ala Asp Val Leu Ser Ala Lys Val Val Gly Ala Ala Asn Leu 
4595 4600 4605 

gac egg ctg ctt gqc gqg caa ccg ttg gac gec ttc gtq ctg ttc tec 13872 

Asp Arg Leu Leu Gly Gly Gin Pro Leu Asp Ala Phe Val Leu Phe Ser 
4610 4615 4620 

teg ate teg gqa gtt tgg gqa gee gqc gqc caa gqa gec tat teg gec 13920 

Ser lie Ser Gly val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ser Ala 
4625 4630 4635 4640 

gec aat gcg tat etc gat gee ctt gec gag cgc cga egg get tgc gqg 13968 

Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Arg Arg Arg Ala Cys Gly 
4645 4650 ~ 4655 
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egg ccg gcg acg tgc ate gec tgg ggt ccg tgg gcg ggt gcg ggc atg 14016 

Arg Pro Ala Thr Cys lie Ala Trp Gly Pro Trp Ala Gly Ala Gly Met 

4660 4665 4670 

gec gtt cag gaa gqt aac gag gcg cat etc cgc cga agg ggc ctg gta 14064 

Ala val Gin Glu Gly Asn Glu Ala His Leu Arg Arg Arg Gly Leu val 
4675 4680 4685 

ccg atg gaa ccg cag teg gec etc ttc gcg ctg caa cag gee ctg tec 14112 

Pro Met Glu Pro Gin Ser Ala Leu Phe Ala Leu Gin Gin Ala Leu Ser 
4690 4695 4700 

caa cga gaa acc gec ate ace gtc gca gat gtg gac tgg gag cga ttc 14160 

Gin Arg Glu Thr Ala lie Thr val Ala Asp val Asp Trp Glu Arg Phe 

4705 4710 4715 4720 

gec gec tct ttc acc gcg gee cgc ccg cga cca ctg ttg gaa gag ate 14208 

Ala Ala Ser Phe Thr Ala Ala Arg Pro Arg Pro Leu Leu Glu Glu lie 

4725 ~ 4730 4735 

gtq gat eta egg ccc gac acc gag acc gag gag aag cac gqt gec ggc 14256 

val Asp Leu Arg Pro Asp Thr Glu Thr Glu Glu Lys His Gly Ala Gly 

4740 4745 4750 

gag ctg ggg cag cag ctg gec gca ctg ccg ccc get gag cgc gga cac 14304 

Glu Leu Gly Gin Gin Leu Ala Ala Leu Pro Pro Ala Glu Arg Gly His 
4755 4760 4765 

ctg ctg ctg gag gtg gtg ctg gcg gaa acc gec age acc ctg ggg cac 14352 

Leu Leu Leu Glu Val val Leu Ala Glu Thr Ala Ser Thr Leu Gly His 
4770 4775 4780 

gat teg gcg gag get gtq caa ccc gat egg acc ttc gec gaa ctg ggc 14400 

Asp Ser Ala Glu Ala Val Gin Pro Asp Arg Thr Phe Ala Glu Leu Gly 

4785 4790 4795 4800 

ttc gat teg ctg acc gcg gta gag ctg cgc aac agg ttg aac gcg gtg 14448 

Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn Ala Val 

4805 4810 ~ 4815 

acc gqg ctt cgc ctg ccg ccg acg ctg gtt ttc gac cac ccg acg ccg 14496 

Thr Gly Leu Arg Leu Pro Pro Thr Leu val Phe Asp His Pro Thr Pro 

4820 4825 4830 

ctg gcg ttg tec gaa cag ttg gtt ccg gec ctg gtc gcg gag ccg gac 14544 

Leu Ala Leu Ser Glu Gin Leu val Pro Ala Leu val Ala Glu Pro Asp 
4835 4840 4845 

aac ggc ate gaa teg ctg etc gee gag etc gac agg ctg gat acc acg 14592 

Asn Gly lie Glu Ser Leu Leu Ala Glu Leu Asp Arg Leu Asp Thr Thr 
4850 4855 4860 

ttg gcg caa ggg cct teg ate cca ctg gaa gac cag gec aag gtg gcg 14640 

Leu Ala Gin Gly Pro Ser lie Pro Leu Glu Asp Gin Ala Lys Val Ala 

4865 4870 4875 * 4880 

gag cgc ttg cac gca etc etc gee aag tgg gac ggg gcg cgt gac ggc 14688 

Glu Arg Leu His Ala Leu Leu Ala Lys Trp Asp Gly Ala Arg Asp Gly 

4885 4890 4895 

acg gec aga gcg acg tea ccc caa teg ctg acg gcg gec acg gac gac 14736 

Thr Ala Arg Ala Thr Ser Pro Gin Ser Leu Thr Ala Ala Thr Asp Asp 

4900 4905 4910 
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gaa ate ttc gac etc ate gac egg aag ttc egg cgc tga 14775 
Glu lie Phe Asp Leu lie Asp Arg Lys Phe Arg Arg 
4915 4920 

<210> 48 
<211> 4924 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 48 

Met Ala Asn Glu Glu Lys Leu Phe Gly Tyr Leu Lys Lys val Thr Ala 
1 5 10 15 

Asp Leu His Gin Thr Arg Gin Arg Leu Leu Ala Ala Glu Ser Arg ser 
20 ~ 25 30 

Gin Glu Pro lie Ala lie val Ser Ala Ser cys Arg Leu Pro Gly Gly 
35 40 45 

Val Asp Ser Pro Glu Ala Leu Trp Gin Leu Val Arg Thr Gly Thr Asp 
50 55 60 

Ala lie Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp Leu Gly Arg Leu 
65 70 75 80 

Tyr Asp Pro Asp Pro Asn His Gin Gly Thr Ser Tyr Thr Arg Ala Gly 
85 90 95 

Gly Phe Leu Ala Gly Ala Gly Asp Phe Asp Pro Ala Met Phe Gly lie 
100 105 110 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu Leu Ser Trp Glu Ala Leu Glu Arg Ala Gly lie Asp Pro Thr ser 
130 135 140 

Leu Arg Gly Ser Lys Thr Gly Val Phe Gly Gly Val Thr Pro Gin Glu 
145 150 155 160 

Tyr Gly Pro Ser Leu Gin Glu Met Ser Arg Asn Ala Gly Gly Phe Gly 
165 170 175 

Leu Thr Gly Arg Met Val Ser Val Ala Ser Gly Arg Val Ala Tyr ser 
180 185 ' " 190 

Phe Gly Phe Glu Gly Pro Ala val Thr val Asp Thr Ala Cys Ser ser 
195 200 205 

Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu 
210 215 220 

Cys Asp Leu Ala Leu Ala Gly Gly val Thr val Met Ala Thr Pro Ala 
225 230 235 240 

Thr Phe val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 ' 270 
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Ala Gly Leu val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

His Glu val Leu Ala val val Arg Gly ser Ala val Asn Gin Asp Gly 
290 295 300 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg val 
305 310 315 320 

lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser val ser Asp val Asp 
325 330 335 

Ala val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu 
340 345 350 

Ala Gin Ala Leu lie Ala Thr Tyr Gly Gin Gly Arg Glu Lys Asp Arg 
355 360 365 

Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

Ala Ala Gly val Ala Gly val lie Lys Met val Leu Ala Met Arg His 
385 390 395 400 

Gly Gin Leu Pro Ala Thr Leu His Val Asp Glu Pro Thr Ser Ala val 
405 410 415 

Asp Trp ser Ala Gly Ser val Arg Leu Leu Thr Glu Asn Thr Pro Trp 
420 425 430 

Pro Asp Ser Gly Arg Pro Cys Arg val Gly Val Ser Ser Phe Gly lie 
435 440 445 

Ser Gly Thr Asn Ala His val lie Leu Glu Gin Ser Pro Val Glu Gin 
450 455 460 

Gly Glu Pro Ala Gly Pro val Glu Gly Glu Arg Glu Pro Asp Val Ala 
465 470 475 480 

val Pro val val Pro Trp val Leu ser Gly Lys Thr pro Glu Ala Ala 
485 490 495 

Arg Ala Gin Ala Glu Arg Val His Ser His lie Glu Asp Arg Pro Gly 
500 505 510 

Leu Ser Pro Val Asp val Ala Tyr Ser Leu Gly Met Thr Arg Ala Ala 
515 520 525 

Leu Asp Glu Arg Ala val val Leu Gly ser Asp Arg Ala Ala Leu Leu 
530 535 540 

Thr Gly Leu Arg Ala Phe Ala Asp Gly Cys Asp Ala Pro Glu val val 
545 550 555 560 

Ser Gly Ser val Gly Leu Gly Gly Arg val Gly Phe val Phe Ser Gly 
565 570 575 

Gin Gly Gly Gin Trp Pro Gly Met Gly Arg Gly Leu Tyr Ser Val Phe 
580 585 590 

Pro val Phe Ala Asp Ala Phe Asp Glu Ala cys Ala Glu Leu Asp Ala 
595 600 605 
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His Leu Gly Gin Glu Leu Arg Val Arg Asp Val val Phe Gly Ser Gin 
610 615 ~ 620 

Ala Trp Leu Leu Asp Arg Thr Val Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

Leu Gin lie Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly val Arg Pro 
645 650 655 

Asp Val val Leu Gly His Ser val Gly Glu Leu Ala Ala val His Ala 
660 665 670 

Ala Gly val Leu Ser Leu Ser Glu Ala Ala Arg Leu Val Ala Gly Arg 
675 680 685 

Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala Val 
690 695 700 

Ala Thr Gly Glu Phe Gin val Asp Pro Leu Leu Asp Gly Val Arg Asp 
705 710 715 720 

Arg lie Gly lie Ala Ala Val Asn Gly Pro Glu Ser Val Val Leu Ser 
725 730 735 

Gly Asp Arg Glu Leu Leu Thr Glu lie Ala Asp Arg Leu His Asp Gin 
740 745 750 

Gly Cys Arg Thr Arg Trp Leu Arg val ser His Ala Phe His Ser Pro 
755 - 760 765 

His Met Glu Pro Met Leu Glu Glu Phe Ala Gin lie Ser Arg Gly Arg 
770 775 780 

Glu Tyr His Ala Pro Glu Leu Pro lie lie Ser Thr Leu lie Gly Glu 
785 790 795 800 

Leu Asp Gly Gly Arg Val Met Gly Thr Pro Glu Tyr Trp val Arg Gin 
805 810 815 

Val Arg Glu Pro Val Arg Phe Ala Glu Gly Val Gin Ala Leu Val Gly 
820 825 830 

Gin Gly val Gly Thr lie val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

Thr Leu Val Glu Glu Cys Val Ala Glu Ser Gly Arg val Ala Gly lie 
850 855 860 

Pro Leu Met Arg Lys Asp Arg Asp Glu Ala Arg Thr Val Leu Ala Ala 
865 870 875 880 

Leu Ala Gin lie His Thr Arg Gly Gly Glu val Asp Trp Arg Ser Phe 
885 " 890 895 

Phe Ala Gly Thr Gly Ala Lys Gin val Asp Leu Pro Thr Tyr Ala Phe 
900 905 910 

Gin Arg Gin Arg Tyr Trp Leu Ala Ser Thr Gly Arg Ala Gly Asp Val 
915 920 925 

Thr Ala Ala Gly Leu Ala Glu Ala Asp His Pro Leu Leu Gly Ala Val 
930 935 940 
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val Ala Leu Ala Asp Gly Glu Gly val val Leu Thr Gly Arg Leu Thr 
945 950 955 960 

Ala Gly ser His Pro Trp Leu Ser Asp His Arg val Leu Gly Glu lie 
965 970 975 

val val Pro Gly Thr Ala lie val Glu Leu val Trp His Val Gly Glu 
980 985 990 

Arg Leu Gly Cys Gly Arg val Glu Glu Leu Ala Leu Glu Ala Pro Leu 
995 1000 1005 

He Leu Pro Asp His Gly Ala val Gin val Gin val Leu val Gly Pro 
1010 1015 1020 

Pro Gly Glu Ser Gly Ala Arg Ser val Ala Leu Tyr Ser Cys Pro Gly 
025 1030 " 1035 1040 

Glu Ala lie Glu Pro Glu Trp Lys Lys His Ala Thr Gly Val Leu Leu 
1045 1050 1055 

Pro Pro val Ala Ala Glu Asn His Glu Leu Thr Ala Trp Pro Pro Glu 
1060 1065 1070 

Asn Ala Thr Glu lie Asp Ala Asp Gly Val Tyr Ala Phe Leu Glu Gly 
1075 1080 1085 

His Gly Phe Ala Tyr Gly Pro Ala Phe Arg Cys Leu Arg Gly Ala Trp 
1090 1095 1100 

Arg Arg Gly Gly Glu val Phe Ala Glu val Ala Leu Pro Asp Asp Met 
105 1110 1115 1120 

Gin Ala Gly val Asp Arg Phe Gly val His Pro Ala Leu Leu Asp Ala 
1125 " 1130 1135 

val Leu His Ala Ala Ala Ala Glu Thr Ser val val Gin ser Glu Ala 
1140 1145 1150 

Arg val Pro Phe Ser Trp Arg Gly Val Glu Leu Arg Ala Thr Glu Ser 
1155 1160 1165 

Ala val val Arg Ala Arg Leu Ser Leu Thr Ser Asp Asp Glu Leu Ser 
1170 1175 1180 

Leu val Ala val Asp Pro Ala Gly Arg Phe Val Ala Thr val Asp Ser 
185 1190 1195 1200 

Leu val Thr Arg Pro lie Ser Arg Gin Gin Val Arg Ser Gly Ala lie 
1205 1210 ~ 1215 

Gly Asp Cys Leu Phe Glu val Glu Trp His Arg Lys Ala Leu Leu Gly 
1220 1225 ' 1230 

Thr Thr Ala Gly Asp Asp Leu Ala lie Val Gly Asp Gly Pro Ser Trp 
1235 1240 1245 

Pro Glu Ser val Arg Ala Thr Ala Arg Phe Ala Thr Leu Asp Glu Phe 
1250 1255 1260 

Arg Ala Ala val Asp Ser Asp val Pro Ala Pro Gly Ser val Leu Val 
265 1270 1275 1280 
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Ala Ala Met Ser Ala Glu Glu val Glu Gly Gly ser Leu pro Ser Arg 
1285 1290 1295 

Ala Gin Glu Ser Thr Ser Asp Leu Leu Ala Leu val Gin ser Trp Leu 
1300 1305 1310 

Ala Asp Glu Arg Phe Ala Glu Ser Gin Leu val val Val Thr Arg Ala 
1315 " 1320 1325 

Ala val Ser Ala Asp Ser Asp Ser Asp val Ala Asp Leu val Gly Ala 
1330 1335 1340 

Ser ser Trp Gly Leu Leu Ser Ser Ala Gin Ser Glu Asn Pro Gly Arg 
345 1350 1355 1360 

Phe val Leu val Asp val Asp Gly Thr Pro Glu Ser Trp Gin Ala Leu 
1365 1370 1375 

Pro Ala Ala val Arg Ala Gly Glu Pro Gin Leu Ala Leu Arg Arg Gly 
1380 1385 1390 

val Ala Leu val Pro Arg Leu Ala Arg Leu Thr val Arg Glu Glu Gly 
1395 1400 1405 

Ser Ser pro Gin Leu Asp Thr Asp Gly Thr Val Leu lie Thr Gly Gly 
1410 1415 1420 

Thr Gly Ala Leu Gly Gly Val Val Ala Arg His Leu val Glu Glu His 
425 1430 1435 1440 

Gly He Arg Arg Leu val Leu Ala Gly Arg Arg Gly Trp Asn Ala Pro 
1445 1450 ~ 1455 

Gly val His Glu Leu val Asp Glu Leu Ala Arg Ala Gly Ala val val 
1460 1465 1470 

Glu val val Ala Cys Asp val Ala Asp Arg Thr Asp Leu Glu His val 
1475 1480 1485 

Leu Ala Ala lie Pro val Asp Trp Pro Leu Arg Gly lie val His Thr 
1490 1495 1500 

Ala Gly val Leu Ala Asp Gly val lie Gly Ser Leu Ser Ala Ala Asp 
505 1510 1515 1520 

val Gly Thr val Phe Ala Pro Lys val Thr Gly Ala Trp His Leu His 
1525 1530 1535 

Glu Leu Thr Arg Asp Leu Asp Leu ser Phe Phe val Leu Phe Ser Ser 
1540 1545 1550 

Phe Ser Gly lie Ala Gly Ala Ala Gly Gin Ala Asn Tyr Ala Ala Ala 
1555 1560 1565 

Asn Thr Phe Leu Asp Ala Leu Ala Arg Tyr Arg Arg Ala Arg Gly Leu 
1570 1575 " 1580 

Pro Gly Leu Ser Leu Ala Trp Gly Leu Trp Ala Gin Pro Ser Gly Met 
585 1590 1595 1600 

Thr Ser Gly Leu Asp Ala Ala Ser Val Glu Arg Leu Ala Arg Thr Gly 
1605 1610 ~ 1615 
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lie Ala Glu Leu Ser Thr Glu Asp Gly Leu Arg Leu Phe Asp Ala Ala 
1620 1625 1630 

Phe Ala Lys Asp Arg Ala Cys Val val Ala Ala Arg Leu Asp Arg Ala 
1635 " 1640 1645 

Leu Leu Val Gly Asn Gly Arg ser His Ala lie Pro Ala Leu Leu Ser 
1650 1655 1660 

Ala Leu Val Pro val Arg Gly Gly val Ala Arg Lys Thr Ala Asn Ser 
665 1670 1675 1680 

Gin Ala Ala Asp Glu Asp Ala Leu Leu Gly Leu val Arg Glu His Val 
1685 1690 1695 

Ser Ala Val Leu Gly Tyr ser Gly Ala val Glu val Gly Gly Asp Arg 
1700 1705 1710 

Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly val Glu Leu Arg 
1715 1720 1725 

Asn Arg Leu Ala Gly val Leu Gly val Arg Leu Pro Ala Thr Ala val 
1730 1735 ' 1740 

Phe Asp Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu 
745 1750 1755 1760 

Leu Ala Gly Glu val Ala ser Thr Ser Thr Pro val Thr Arg Ala Ala 
1765 1770 1775 

Ser Ala Glu Glu Asp Leu Val Ala lie val Gly Met Gly Cys Arg Phe 
1780 1785 1790 

Pro Gly Gly val Ser Ser Pro Glu Glu Leu Trp Arg Leu val Ala Gly 
1795 1800 1805 

Gly Val Asp Ala val Ala Gly Phe Pro Asp Asp Arg Gly Trp Asp Leu 
1810 1815 1820 

Ala Ala Leu Tyr Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr Val 
825 1830 1835 1840 

Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Asp Met 
1845 " 1850 1855 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
1860 1865 1870 

Leu Leu Leu Glu Val Ala Trp Glu Thr Leu Glu Arg Ala Gly lie Asp 
1875 1880 1885 

Pro Phe Ser Leu His Gly Ser Arg Thr Gly val Phe Ala Gly Leu Met 
1890 1895 1900 

Tyr His Asp Tyr Gly Ala Arg Phe lie Thr Arg Ala Pro Glu Gly Phe 
905 1910 " 1915 1920 

Glu Gly His Leu Gly Thr Gly Asn Ala Gly Ser Val Leu Ser Gly Arg 
1925 1930 1935 

val Ala Tyr ser Phe Gly Phe Glu Gly Pro Ala val Thr val Asp Thr 
1940 1945 1950 
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Ala cys ser Ser ser Leu val Ala Leu His Leu Ala Gly Gin Ala Leu 
1955 1960 1965 

Arg Ala Gly Glu Cys Glu Phe Ala Leu Ala Gly Gly Val Thr Val Met 
1970 1975 1980 

Ser Thr Pro Thr Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
985 1990 1995 2000 

Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly 
2005 2010 2015 

Trp Gly Glu Gly Ala Gly Leu val Leu Leu Glu Arg Leu Ser Asp Ala 
2020 2025 ~ 2030 

Arg Arg Asn Gly His Glu val Leu Ala val Val Arg Gly Ser Ala val 
2035 2040 2045 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
2050 2055 2060 

Gin Gin Arg val lie Thr Gin Ala Leu Thr Ser Ala Gly Leu Ser Val 
065 ~ 2070 2075 2080 

Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
2085 2090 2095 

Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg 
2100 2105 2110 

Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly 
2115 2120 2125 

His Thr Gin Ala Ala Ala Gly val Ala Gly val lie Lys Met Val Met 
2130 2135 2140 

Ala Met Arg Gin Gly Glu Leu Pro Arg Thr Leu His val Asp Glu Pro 
145 2150 2155 2160 

Ser Ala Gin val Asp Trp ser Ala Gly Thr Val Gin Leu Leu Thr Glu 
2165 2170 2175 

Asn Thr Pro Trp Pro Asp ser Gly Arg Leu Arg Arg Ala Gly val Ser 
2180 2185 2190 

Ser Phe Gly lie Ser Gly Thr Asn Ala His Leu lie Leu Glu Gin Pro 
2195 2200 2205 

Pro Arg Glu Ser Gin Arg Ser Thr Glu Pro Asp Ser Gly Ser Val Arg 
2210 2215 2220 

Asp Phe Pro Val val Pro Trp Met Val Ser Gly Lys Thr Pro Glu Ala 
225 2230 2235 2240 

Leu Ser Ala Gin Ala Asp Ala Leu Met ser Tyr Leu Ser Asn Arg val 
2245 2250 2255 

Asp Ala Ser Pro Arg Asp lie Gly Tyr Ser Leu Ala val Thr Arg Pro 
2260 2265 2270 

Ala Leu Asp His Arg Ala val val Leu Gly Ala Asp Arg Ala Ala Leu 
2275 ~ 2280 2285 
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Leu Pro Gly Leu Lys Ala Leu Ala val Ser Asn Asp Ala Ala Glu val 
2290 2295 2300 

lie Thr Gly Thr Arg Ala Ala Gly Pro val Gly Phe val Phe Ser Gly 
305 2310 2315 2320 

Gin Gly Gly Gin Trp Pro Gly Met Gly Ser Gly Leu His Ser Ala Phe 
2325 2330 2335 

Pro val Phe Ala Asp Ala Phe Asp Glu Ala Cys Cys Glu Leu Asp Ala 
2340 2345 2350 

His Leu Gly Gin Met Ala Arg Leu Arg Asp val Leu ser Gly Ser Asp 
2355 2360 2365 

Thr Gin Leu Leu Asp Gin Thr Leu Trp Ala Gin Pro Gly Leu Phe Ala 
2370 2375 2380 

Leu Gin val Gly Leu Trp Glu Leu Leu Gly Ser Trp Gly val Arg Pro 
385 2390 2395 2400 

Ala val val Leu Gly His Ser val Gly Glu Leu Ala Ala Ala Phe Ala 
2405 2410 2415 

Ala Gly val Leu Ser Leu Arg Asp Ala Ala Arg Leu val Ala Gly Arg 
2420 ~ 2425 " 2430 

Ala Arg Leu Met Gin Ala Leu Pro Thr Gly Gly Ala Met Leu Ala Ala 
2435 2440 2445 

Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala Asp Cys Gly Asp 
2450 2455 2460 

Arg val Gly lie Ala Ala val Asn Ala Pro Gly Ser val val Leu Ser 
465 2470 2475 2480 

Gly Asp Arg Asp Val Leu Asp Asp lie Ala Gly Arg Leu Asp Gly Gin 
2485 2490 2495 

Gly lie Arg ser Arg Trp Leu Arg val ser His Ala Phe His Ser His 
2500 2505 2510 

Arg Met Asp Pro Met Leu Ala Glu Phe Thr Glu lie Ala Arg Ser Val 
2515 2520 2525 

Asp Tyr Arg ser Ser Gly Leu Pro lie Val Ser Thr Leu Thr Gly Glu 
2530 2535 2540 

Leu Asp Glu val Gly Met Pro Ala Thr Pro Glu Tyr Trp val Arg Gin 
545 2550 2555 2560 

val Arg Glu Pro val Arg Phe Ala Asp Gly val Ala Ala Leu Ala Ala 
2565 2570 2575 

His Gly val ser Thr val val Glu val Gly Pro Asp Gly val Leu Ser 
2580 2585 2590 

Ala Leu Val Gin Glu Cys Ala Ala Gly Ser Asp Gin Gly Gly Arg val 
2595 2600 2605 

Ala Ala Val Pro Leu Met Arg Ser Asn Arg Asp Glu Ala His Thr Val 
2610 2615 2620 
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Thr Thr Ala Leu Ala Gin lie His Val Arg Gly Ala Glu Val Asp Trp 
625 2630 2635 2640 

Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin Val Glu Leu Pro Thr 
2645 2650 2655 

Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ser Pro Ser Glu Pro 
2660 2665 2670 

Val Gly Gin Ser Ala Asp Pro Ala Arg Gin Ser Gly Phe Trp Glu Leu 
2675 2680 2685 

val Glu Gin Glu Asp val Ser Ala Leu Ser Ala Ala Leu His lie Thr 
2690 2695 2700 

Gly Asp His Asp Val Gin Ala Ser Leu Glu Ser Val Val Pro val Leu 
705 2710 2715 2720 

Ser Ser Trp His Arg Arg lie Arg Asn Glu Ser Leu val His Gin Trp 
2725 2730 2735 

Arg Tyr Arg lie Ser Trp His Glu Arg Ala Asp Leu Pro Asp Pro Ser 
2740 2745 2750 

Leu ser Gly Thr Trp Leu val val val Pro Glu Gly Trp Ser Ala Ser 
2755 2760 2765 

Arg Gin val Leu Arg Phe Asn Glu Met Phe Glu Glu Arg Gly Cys Pro 
2770 ~ 2775 2780 

Ala val Leu Phe Glu Leu Ala Gly His Asp Glu Glu Ala Leu Ala Gin 
785 2790 2795 2800 

Arg Phe Arg Ser Leu Pro Val Ala Ser Gly Gly lie Ser Gly Val Leu 
2805 2810 2815 

Ser Leu Leu Ala Leu Asp Glu Ser pro Ser Ser Pro Asn Ala Ala Leu 
2820 2825 2830 

Pro Asn Gly Ala Leu Asn Ser Leu Val Leu Leu Arg Ala Leu Arg Ala 
2835 2840 2845 

Ala Asp Val Ser Ala Pro Leu Trp Leu Ala Thr Cys Gly Gly Val Ala 
2850 2855 2860 

Val Gly Asp val Pro Val Asn Pro Gly Gin Ala Leu val Trp Gly Leu 
865 2870 2875 2880 

Gly Arg Val Val Gly Leu Glu His Pro Ala Trp Trp Gly Gly Leu val 
2885 2890 2895 

Asp Val Pro Cys Leu Leu Asp Glu Asp Ala Arg Glu Arg Leu Ser val 
2900 2905 2910 

val Leu Ala Gly Leu Gly Glu Asp Glu lie Ala val Arg Pro Gly Gly 
2915 2920 2925 

Val Phe Val Arg Arg Leu Glu Arg Ala Gly Ala Ala Ser Gly Ala Gly 
2930 2935 2940 

Ser Val Trp Arg Pro Arg Gly Thr Val Leu val Thr Gly Gly Thr Gly 
945 2950 2955 2960 
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Gly Leu Gly Ala His val Ala Arg Trp Leu Ala Gly Ala Gly Ala Glu 
2965 2970 2975 

His val val Leu Thr ser Arg Arg Gly Ala Ala Ala Pro Gly Ala Gly 
2980 2985 2990 

Asp Leu Arg Ala Glu Leu Glu Ala Leu Gly Ala Arg val Ser lie Thr 
2995 3000 3005 

Ala cys Asp val Ala Asp Arg Asp Ala Leu Ala Glu val Leu Ala Thr 
3010 3015 3020 

lie Pro Asp Asp Cys Pro Leu Thr Ala val Met His Ala Ala Gly val 
025 3030 3035 3040 

val Glu val Gly Asp val Ala Ser Met Cys Leu Thr Asp Phe val Gly 
3045 3050 3055 

val Leu ser Ala Lys Ala Gly Gly Ala Ala Asn Leu Asp Glu Leu Leu 
3060 3065 3070 

Ala Asp val Glu Leu Asp Ala Phe val Leu Phe ser ser val ser Gly 
3075 3080 3085 

val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Tyr 
3090 3095 3100 

Leu Asp Ala Leu Ala Gin Gin Arg Arg Ala Arg Gly Leu Val Gly Thr 
105 3110 ~ " 3115 ' 3120 

Ala val Ala Trp Gly Pro Trp Ala Gly Asp Gly Met Ala Ala Gly Glu 
3125 3130 3135 

Gly Gly Ala Gin Leu Arg Arg Ala Gly Leu Val Pro Met Ala Ala Asp 
3140 ~ 3145 3150 

Arg Ala Leu Leu Ala Leu Gin Gly Ala Leu Asp Arg Asp Glu Thr Ser 
3155 3160 3165 

Leu val val Ala Asp Met Ala Trp Glu Arg Phe Ala Pro val Phe Ala 
3170 3175 3180 

Met Ser Arg Arg Arg Pro Leu Leu Asp Glu Leu Pro Glu Ala Gin Gin 
185 " 3190 3195 3200 

Ala Leu Ala Asp Ala Glu Asn Thr Thr Asp Ala Ala Asp Ser Ala val 
3205 3210 3215 

Pro Leu Pro Arg Leu Ala Gly Met Ala Ala Ala Glu Arg Arg Arg Ala 
3220 3225 3230 

Met Leu Asp Leu Val Leu Ala Glu Ala Ser lie val Leu Gly His Asn 
3235 3240 3245 

Gly Ser Asp Pro Val Gly Pro Asp Arg Ala Phe Gin Glu Leu Gly Phe 
3250 3255 " 3260 

Asp Ser Leu Met Ala val Glu Leu Arg Asn Arg Leu Gly Glu Ala Thr 
265 3270 ' 3275 3280 

Gly Leu Ser Leu Pro Ala Thr Leu lie Phe Asp Tyr Pro Ser Pro Ser 
3285 3290 ' 3295 
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Ala Leu Ala Glu Gin Leu val Gly Glu Leu val Gly Ala Gin Pro Ala 
3300 3305 ' 3310 

Thr Thr val val Ala Gly Ala Asp Pro val Asp Asp Pro val val val 
3315 3320 3325 

val Ala Met Gly cys Arg Tyr Pro Gly Asp val cys Ser pro Glu Glu 
3330 3335 3340 

Leu Trp Gin Leu Val Ser Ala Gly Arg Asp Ala val Ser Thr Phe Pro 
345 3350 ' 3355 3360 

val Asp Arg Gly Trp Asp Cys Asn Thr Leu Phe Asp Pro Asp Pro Asp 
3365 3370 3375 

Arg Ala Gly ser Thr Tyr val Arg Glu Gly Ala Phe Leu Thr Gly Ala 
3380 3385 3390 

Asp Arg Phe Asp Ala Gly Phe Phe Gly lie Ser Pro Arg Glu Ala Arg 
3395 3400 3405 

Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu val Ala Trp Glu val 
3410 3415 3420 

Phe Glu Arg Ala Gly lie Ala Pro Leu Ser Leu Arg Gly Ser Arg Thr 
425 ' 3430 3435 3440 

Gly val Phe Ala Gly Thr Asn Gly Gin Asp His Gly Ala Lys Val Ala 
3445 3450 3455 

Ala Ala Pro Glu Ala Ala Gly His Leu Leu Thr Gly Asn Ala Ala Ser 
3460 3465 3470 

val Leu Ala Gly Arg Leu Ser Tyr Thr Phe Gly Leu Glu Gly Pro Ala 
3475 3480 3485 

val Ala val Asp Thr Ala Cys ser Ser ser Leu Val Ala Leu His Leu 
3490 3495 3500 

Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Met Ala Leu Ala Gly 
505 3510 3515 3520 

Gly Val Thr val Met Ser Thr Pro Leu Ala Phe Leu Glu Phe Ser Arg 
3525 3530 3535 

Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala 
3540 3545 3550 

Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu val Leu Leu Glu 
3555 3560 3565 

Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg val Leu Ala val val 
3570 3575 3580 

Arg Gly ser Ala val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala 
585 3590 3595 3600 

Pro Asn Gly Pro Ser Gin Gin Arg Val lie Arg Gin Ala Leu Ala Asn 
3605 " 3610 3615 

Ala Gly Leu Ser Ala Ser Asp val Asp val val Glu Ala His Gly Thr 
3620 3625 3630 

Page 202 



LeA33955-US 

Gly Thr Gly Leu Gly Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr 
3635 3640 3645 

Tyr Gly Gin Glu Arg Asp Pro Glu Arg Ala Leu Trp Leu Gly Ser lie 
3650 ~ 3655 ~ 3660 

Lys Ser Asn He Gly His Thr Gin Ala Ala Ala Gly val Ala Gly val 
665 3670 3675 3680 

lie Lys Met Val Gin Ala Met Arg His Gly Glu Leu Pro Ala Thr Leu 
3685 3690 3695 

His val Asp Lys Pro Thr Pro Gin val Asp Trp ser Ala Gly Ala val 
3700 3705 3710 

Arg Leu Leu Thr Gly Asn Thr Pro Trp Pro Glu Ser Gly Arg Pro Arg 
3715 3720 3725 

Arg Ala Gly val Ser ser Phe Gly lie Ser Gly Thr Asn Ala His Leu 
3730 3735 3740 

lie Leu Glu Gin Pro Pro Ser Glu Pro Ala Glu He Asp Gin Ser Asp 
745 3750 3755 3760 

Arg Arg val Thr Ala His Pro Ala val lie Pro Trp Met Leu Ser Ala 
3765 3770 3775 

Arg Ser Leu Ala Ala Leu Gin Ala Gin Ala Ala Ala Leu Gin Ala Arg 
3780 3785 3790 

Leu Asp Arg Gly Pro Gly Ala Ser Pro Leu Asp Leu Gly Tyr Ser Leu 
3795 3800 3805 

Ala Thr Thr Arg ser val Leu Asp Glu Arg Ala val val Trp Gly Ala 
3810 3815 3820 

Asp Arg Glu Ala Leu Leu Ser Arg Leu Ala Ala Leu Ala Asp Gly Arg 
825 ^ 3830 3835 3840 

Thr Ala Pro Gly val lie Thr Gly ser Ala Asn Ser Gly Gly Arg lie 
3845 3850 3855 

Gly Phe val Phe Ser Gly Gin Gly Ser Gin Trp Leu Gly Met Gly Lys 
3860 3865 3870 

Ala Leu cys Ala Ala Phe Pro Ala Phe Ala Asp Ala Phe Glu Glu Ala 
3875 3880 3885 

Cys Asp Ala Leu Ser Ala His Leu Gly Ala Asp Val Arg Gly val Leu 
3890 3895 3900 

Phe Gly Ala Asp Glu Gin Met Leu Asp Arg Thr Leu Trp Ala Gin Ser 
905 3910 3915 3920 

Gly lie Phe Ala val Gin val Gly Leu Leu Gly Leu Leu Arg Ser Trp 
3925 3930 3935 

Gly val Arg Pro Ala Ala Val Leu Gly His Ser val Gly Glu Leu Ala 
3940 3945 3950 

Ala Ala His Ala Ala Gly val Leu Ser Leu Pro Asp Ala Ala Arg Leu 
3955 3960 3965 
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val Ala Ala Arg Ala His Leu Met Gin Ala Leu Pro Thr Gly Gly Ala 
3970 3975 3980 

Met Leu Ala val Ala Thr ser Glu Ala Ala val Gly Pro Leu Leu ser 
985 3990 3995 4000 

Gly val Cys Asp Arg val Ser lie Ala Ala lie Asn Gly Pro Glu Ser 
4005 4010 4015 

val Val Leu Ser Gly Asp Arg Asp val Leu val Glu Leu Ala Gly Glu 
4020 4025 4030 

Phe Asp Ala Arg Gly Leu Arg Thr Lys Trp Leu Arg val Ser His Ala 
4035 4040 4045 

Phe His Ser His Arg Met Glu Pro lie Leu Asp Glu Tyr Ala Glu Thr 
4050 4055 4060 

Ala Arg cys val Glu Phe Gly Glu Pro val val Pro lie val Ser Ala 
065 " 4070 4075 4080 

Ala Thr Gly Ala Leu Asp Thr Thr Gly Leu Met Cys Ala Ala Asp Tyr 
4085 4090 4095 

Trp Thr Arg Gin val Arg Asp Pro Val Arg Phe Gly Asp Gly Val Arg 
4100 ^ 4105 4110 

Ala Leu Val Gly Gin Gly Val Asp Thr lie val Glu Phe Gly Pro Asp 
4115 4120 4125 

Gly Ala Leu Ser Ala Leu Val Glu Gin Cys Leu Ala Gly Ser Asp Gin 
4130 4135 4140 

Ala Gly Arg val Ala Ala lie Pro Leu Met Arg Arg Asp Arg Asp Glu 
145 ^ 4150 4155 4160 

val Glu Thr Ala val Ala Ala Leu Ala His val His Val Arg Gly Gly 
4165 4170 4175 

Ala Val Asp Trp ser Ala Cys Phe Ala Gly Thr Gly Ala Arg Thr Val 
4180 4185 4190 

Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Ala Gly 
4195 4200 4205 

Gin Ala Asp Gly Arg Gly Gly Asp val val Ala Asp Pro Val Asp Ala 
4210 4215 4220 

Arg Phe Trp Glu Leu val Glu Arg Ala Asp Pro Glu Pro Leu Val Asp 
225 4230 4235 4240 

Glu Leu cys lie Asp Arg Asp Gin Pro Phe Arg Glu Val Leu Pro Val 
4245 4250 4255 

Leu Ala Ser Trp Arg Glu Lys Gin Arg Gin Glu Ala Leu Ala Asp Ser 
4260 4265 4270 

Trp Arg Tyr Gin Val Arg Trp Arg Ser Val Glu Val Pro Ser Ala Ala 
4275 4280 4285 

Ala Leu Arg Gly Val Trp Leu Val Val Leu Pro Ala Asp Val Pro Arg 
4290 ~ 4295 4300 
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Asp Gin Pro Ala val val He Asp Ala Leu lie Ala Arg Gly Ala Glu 
305 4310 4315 4320 

val Ala Val Leu Glu Leu Thr Glu Gin Asp Leu Gin Arg Ser Ala Leu 
4325 4330 4335 

val Asp Lys val Arg Ala val He Ala Asp Arg Thr Glu val Thr Gly 
4340 4345 ~ 4350 

Val Leu ser Leu Leu Ala Met Asp Gly Met Pro Cys Ala Ala His Pro 
4355 4360 4365 

His Leu ser Arg Gly Val Ala Ala Thr Val lie Leu Thr Gin Val Leu 
4370 4375 4380 

Gly Asp Ala Gly val Ser Ala Pro Leu Trp Leu Ala Thr Thr Gly Gly 
385 4390 4395 4400 

val Glu Ala Gly Thr Glu Asp Gly Pro Ala Asp Pro Asp His Gly Leu 
4405 " 4410 4415 

lie Trp Gly Leu Gly Arg val val Gly Leu Glu His Pro Gin Trp Trp 
4420 ' " 4425 4430 

Gly Gly Leu lie Asp Leu Pro Glu Thr Leu Asp Glu Thr Ser Arg Asn 
4435 4440 4445 

Gly Leu val Ala Ala Leu Ala Gly Thr Ala Ala Glu Asp Gin Leu Ala 
4450 4455 4460 

val Arg Ser ser Gly Leu Phe Val Arg Arg Val Val Arg Ala Ala Arg 
465 4470 ~ 4475 " 4480 

Asn Pro Arg Ser Glu Thr Trp Arg Ser Arg Gly Thr val Leu lie Thr 
4485 " 4490 4495 

Gly Gly Thr Gly Ala Leu Gly Ala Glu val Ala Arg Trp Leu Ala Arg 
4500 4505 4510 

Arg Gly Ala Glu His Leu val Leu lie Ser Arg Arg Gly Pro Glu Ala 
4515 4520 4525 

Pro Gly Ala Ala Asp Leu Gly Ala Glu Leu Thr Glu Leu Gly Val Lys 
4530 4535 4540 

Val Thr Val Leu Ala Cys Asp Val Thr Asp Arg Asp Glu Leu Ala Ala 
545 4550 4555 4560 

val Leu Ala Ala val Pro Thr Glu Tyr Pro Leu ser Ala val val His 
4565 4570 4575 

Thr Ala Gly val Gly Thr Pro Ala Asn Leu Ala Glu Thr Thr Leu Ala 
4580 4585 4590 

Gin Phe Ala Asp val Leu ser Ala Lys val val Gly Ala Ala Asn Leu 
4595 4600 4605 

Asp Arg Leu Leu Gly Gly Gin Pro Leu Asp Ala Phe val Leu Phe Ser 
4610 4615 4620 

Ser lie Ser Gly Val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ser Ala 
625 4630 ' 4635 " 4640 
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Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Arg Arg Arg Ala Cys Gly 
4645 4650 ^ 4655 

Arg Pro Ala Thr Cys lie Ala Trp Gly Pro Trp Ala Gly Ala Gly Met 
4660 4665 4670 

Ala val Gin Glu Gly Asn Glu Ala His Leu Arg Arg Arg Gly Leu val 
4675 4680 ~ 4685 

Pro Met Glu Pro Gin Ser Ala Leu Phe Ala Leu Gin Gin Ala Leu Ser 
4690 4695 4700 

Gin Arg Glu Thr Ala lie Thr Val Ala Asp Val Asp Trp Glu Arg Phe 
705 4710 4715 4720 

Ala Ala Ser Phe Thr Ala Ala Arg Pro Arg Pro Leu Leu Glu Glu lie 
4725 ~ 4730 4735 

val Asp Leu Arg Pro Asp Thr Glu Thr Glu Glu Lys His Gly Ala Gly 
4740 4745 4750 

Glu Leu Gly Gin Gin Leu Ala Ala Leu Pro Pro Ala Glu Arg Gly His 
4755 4760 4765 

Leu Leu Leu Glu val Val Leu Ala Glu Thr Ala Ser Thr Leu Gly His 
4770 4775 4780 

Asp Ser Ala Glu Ala Val Gin Pro Asp Arg Thr Phe Ala Glu Leu Gly 
785 4790 4795 4800 

Phe Asp Ser Leu Thr Ala val Glu Leu Arg Asn Arg Leu Asn Ala val 
4805 4810 4815 

Thr Gly Leu Arg Leu Pro Pro Thr Leu val Phe Asp His Pro Thr Pro 
4820 4825 4830 

Leu Ala Leu Ser Glu Gin Leu Val Pro Ala Leu val Ala Glu Pro Asp 
4835 4840 4845 

Asn Gly lie Glu ser Leu Leu Ala Glu Leu Asp Arg Leu Asp Thr Thr 
4850 4855 4860 

Leu Ala Gin Gly Pro Ser lie Pro Leu Glu Asp Gin Ala Lys Val Ala 
865 4870 4875 4880 

Glu Arg Leu His Ala Leu Leu Ala Lys Trp Asp Gly Ala Arg Asp Gly 
4885 4890 4895 

Thr Ala Arg Ala Thr ser Pro Gin Ser Leu Thr Ala Ala Thr Asp Asp 
4900 4905 4910 

Glu lie Phe Asp Leu lie Asp Arg Lys Phe Arg Arg 
4915 4920 



<210> 49 
<211> 16767 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 
<221> CDS 
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<222> (1).. (16764) 

<223> ORF22; polyketide synthase 

<400> 49 

atg gcc aat gaa gaa aag etc cgc gag tac etc aag cgt gtc gtc gtc 48 
Met Ala Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg val val val 
1 5 10 15 

gaa ctg gaa gag gcg cac gaa cgc ctg cac gag ttg gag cgc cag gag 96 
Glu Leu Glu Glu Ala His Glu Arg Leu His Glu Leu Glu Arg Gin Glu 
20 25 30 

cac gac ccc ate gcg ate gtg teg atg gga tgt cgt tat ccc ggt ggc 144 
His Asp Pro lie Ala lie val Ser Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 

gtc tec act ccg gag gag ctg tgg cga ctg gtc gtc gac gga gga gac 192 
val Ser Thr Pro Glu Glu Leu Trp Arg Leu val val Asp Gly Gly Asp 
50 55 60 

gcg ate gcg aac ttc ccc gaa gac cgt gqc tgg aat ctg gac gag ctg 240 
Ala lie Ala Asn Phe Pro Glu Asp Arg Gly Trp Asn Leu Asp Glu Leu 
65 70 75 80 

ttc gat cct gat ccg ggc cga gcc ggg acc tec tac gtc cgc gag ggt 288 
Phe Asp Pro Asp Pro Gly Arg Ala Gly Thr Ser Tyr Val Arg Glu Gly 
85 90 95 

ggt ttc ctg cgc ggg gtc gcg gac ttc gat gcc ggg etc ttc ggg ate 336 
Gly Phe Leu Arg Gly val Ala Asp Phe Asp Ala Gly Leu Phe Gly lie 
100 105 110 

agt ccg cgc gag gca cag gcg atg gac ccg caa cag egg ttg ctg ctg 384 
Ser Pro Arg Glu Ala Gin Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

gag ate teg tgg gag gtg ttc gag cgc gcc ggc att gac ccg ttt tct 432 
Glu lie Ser Trp Glu Val Phe Glu Arg Ala Gly lie Asp Pro Phe Ser 
130 135 140 

ttg egg ggt acc aag acc ggt gtg ttc gcg ggc ctg ate tac cac gac 480 
Leu Arg Gly Thr Lys Thr Gly val Phe Ala Gly Leu lie Tyr His Asp 
145 150 155 160 

tac gcg teg egg ttt cgc aag acc ccc gcg gag ttc gag ggt tac ttc 528 
Tyr Ala Ser Arg Phe Arg Lys Thr Pro Ala Glu Phe Glu Gly Tyr Phe 
165 170 175 

gcc acc ggc aac gcg ggc age gtc gca tec ggc egg gtg get tac acc 576 
Ala Thr Gly Asn Ala Gly Ser val Ala ser Gly Arg val Ala Tyr Thr 
180 185 190 

ttc ggg tta gag gqc ccg gcg gtc acc gtg gac acc gcc tgc teg teg 624 
Phe Gly Leu Glu Gly Pro Ala val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

tec ctg gtg gcg ctg cac ctg gcc tgc cag tec ctg egg ctg ggc gaa 672 
Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Leu Gly Glu 
210 215 220 

tgc gac ctg gcc ctg gcc ggt ggc att teg gtg atg gcc acg ccg gga 720 

Cys Asp Leu Ala Leu Ala Gly Gly lie Ser Val Met Ala Thr Pro Gly 
225 230 235 240 
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gcc ttc gtc gag ttc age egg caa cgc gca etc gec teg gat ggc egg 768 

Ala Phe Val Glu Phe Ser Arg Gin Arg Ala Leu Ala Ser Asp Gly Arg 

245 " 250 255 

tgc aag ccc ttc gcg gat gcc gcc gac gqc acc gqc tgg gqc gag gqc 

Cys Lys Pro Phe Ala Asp Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 

260 265 270 



cac ccg gtg ctg gcg gcg gtg gtc ggt tec gcg ate aac cag gac gqg 

His Pro val Leu Ala Ala val Val Gly Ser Ala lie Asn Gin Asp Gly 
290 295 300 

acg tec aac gqc ctg acc gcg ccc age ggt ccc gca cag cag cga gtq 

Thr Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val 

305 310 315 320 



gtq gtc gag gcg cac gqc acg gqc acg gcc ttg gqc gac ccg ate gag 
val val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro lie Glu 
340 345 350 



816 



gcc gqa atg ctg ctg ctg gaa egg ctg teg gac gca cga cga aac gqc 864 
Ala Gly Met Leu Leu Leu Glu Arg Leu ser Asp Ala Arg Arg Asn Gly 
275 280 285 



912 



960 



ate cgc caa gcc ctg gcg aac gcc gqg ttg teg ccc gcc gag gtc gat 1008 
lie Arg Gin Ala Leu Ala Asn Ala Gly Leu Ser Pro Ala Glu val Asp 
325 330 335 



1056 



gcg cag gcc ctg ate gcc acc tac gqg gcg aac egg teg gcg gat cat 1104 

Ala Gin Ala Leu lie Ala Thr Tyr Gly Ala Asn Arg Ser Ala Asp His 
355 360 365 

ccg ctg ctg ctg ggt tec etc aag teg aac ate ggc cac acc cag get 1152 

Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

gcc gcc ggt gtg gcc ggg gtg ate aag teg gtc ctg gcc ate agg cac 

Ala Ala Gly Val Ala Gly Val lie Lys Ser val Leu Ala lie Arg His 

385 390 395 400 



1200 



egg gag atg ccc cgc age ctg cac ate gac cag cca teg cag cac gtg 1248 

Arg Glu Met Pro Arg Ser Leu His lie Asp Gin Pro Ser Gin His Val 

405 410 415 

gac tgg teg gcg ggc gcg gtg egg ctg etc acg gac age gtt gac tgg 1296 

Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Asp Ser Val Asp Trp 

420 425 430 

ccg gat etc ggc agg ccg cgc cga gca ggg gtg tec teg ttc ggc atg 1344 

Pro Asp Leu Gly Arg Pro Arg Arg Ala Gly val Ser Ser Phe Gly Met 

435 440 445 

age ggt acc aac gca cac ctg ate gtc gag gaa gta tec gac gag ccg 

Ser Gly Thr Asn Ala His Leu lie Val Glu Glu val Ser Asp Glu Pro 

450 455 460 



1392 



gtc teg ggc agt acc gag ccg acc ggg gca ttt ccc tgg ccg ctg tec 1440 
Val Ser Gly Ser Thr Glu Pro Thr Gly Ala Phe Pro Trp Pro Leu Ser 
465 470 475 480 

ggc aag acg gag acg gca ttg cgc gag cag get gcc gag ttg etc tec 1488 
Gly Lys Thr Glu Thr Ala Leu Arg Glu Gin Ala Ala Glu Leu Leu Ser 
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485 490 495 

gta gtq acc gag cac ccg gag ccg gqa ctg gqg gac gtc gqg tac teg 1536 
Val val Thr Glu His Pro Glu Pro Gly Leu Gly Asp val Gly Tyr Ser 
500 505 * 510 



ctg gec acc gqt cgc get gcg atg gag cac egg get gtc gtq gtt gec 
Leu Ala Thr Gly Arg Ala Ala Met Glu His Arg Ala val val val Ala 
515 520 " 525 



ctt cgc gqt gat ccc gat gca ccc gcg ctg gat cgc gac gac gtq att 
Leu Arg Gly Asp Pro Asp Ala Pro Ala Leu Asp Arg Asp Asp val lie 
610 615 620 



teg tgc gqa gtq gag ccc gee gcg gtq gtc gqt cat tec cag gqc gag 

Ser Cys Gly Val Glu Pro Ala Ala Val Val Gly His Ser Gin Gly Glu 

645 650 655 

ate gee gee gee cat gtq gca gqc get ttg tec ttg act gat gcg gtq 

lie Ala Ala Ala His val Ala Gly Ala Leu Ser Leu Thr Asp Ala Val 

660 665 670 



gqa gqc atg etc gcg att gee ttg ccg gaa age gcg gtq gtq aag cga 
Gly Gly Met Leu Ala He Ala Leu Pro Glu Ser Ala Val Val Lys Arg 
690 695 700 

ate gca gqc ctg ccg gag ctg acc gtt gcg gcg gtc aac gqa ccc gqc 
lie Ala Gly Leu Pro Glu Leu Thr Val Ala Ala val Asn Gly Pro Gly 
705 710 715 720 



1584 



gac gat egg gac tct ttc gtc gec gqa ctg acg gcg ttg get gcg ggc 1632 
Asp Asp Arg Asp Ser Phe val Ala Gly Leu Thr Ala Leu Ala Ala Gly 
530 535 540 

gtt ccg gca gec aac gtq gtq cag gqc gcg gec gac tgc aag gqa aag 1680 
val Pro Ala Ala Asn val Val Gin Gly Ala Ala Asp Cys Lys Gly Lys 
545 550 555 560 

gtc gcg ttc gtq ttc ccc gqc cag gqc teg cat tgg cag gqg atg gcg 1728 
val Ala Phe Val Phe Pro Gly Gin Gly ser His Trp Gin Gly Met Ala 
565 570 575 

agg gaa ctg tec gaa tec teg ccg gtq ttc egg egg aag ctg gcg gaa 1776 
Arg Glu Leu Ser Glu Ser Ser Pro Val Phe Arg Arg Lys Leu Ala Glu 
580 585 590 

tgc gcg gcg get acg gec cct tac gtq gac tgg teg ctg etc gqc gtc 1824 
Cys Ala Ala Ala Thr Ala Pro Tyr val Asp Trp Ser Leu Leu Gly val 
595 600 605 



1872 



cag etc gcg ctg ttc gee atg atg gtq teg ctg gec gaa ctg tgg cgt 1920 
Gin Leu Ala Leu Phe Ala Met Met Val Ser Leu Ala Glu Leu Trp Arg 
625 630 635 640 



1968 



2016 



cgc ate ate get gec cgc tgc gat gcg gtq teg gcg ctg acc gqg aag 2064 
Arg lie lie Ala Ala Arg Cys Asp Ala Val Ser Ala Leu Thr Gly Lys 
675 680 685 



2112 



2160 



tec act gtc gtt tec gqc gaa ccg teg get ctg gag cgt ctg cag acc 2208 

Ser Thr val val Ser Gly Glu Pro Ser Ala Leu Glu Arg Leu Gin Thr 

725 730 ~ 735 

gaa ctg acc gcg gaa aac gtg cag acc egg egg gtg gga att gat tac 2256 
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Glu Leu Thr Ala Glu Asn Val Gin Thr Arg Arg val Gly lie Asp Tyr 
740 745 750 

gcc teg cat teg ccg cag ate gcg cag gtc cag ggc egg ctt ctg gac 2304 
Ala ser His Ser Pro Gin lie Ala Gin Val Gin Gly Arg Leu Leu Asp 
755 760 765 

egg ctg ggc gaa gtc ggg tec gaa cct get gag ate get ttc tac teg 2352 
Arg Leu Gly Glu val Gly Ser Glu Pro Ala Glu lie Ala Phe Tyr Ser 
770 775 780 

acg gtc ace ggc gag egg acg gac ace ggc cga etc gac gcc gac tac 
Thr val Thr Gly Glu Arg Thr Asp Thr Gly Arg Leu Asp Ala Asp Tyr 
785 790 795 800 



egg atg gca gat cag ggc tat egg ttc ttc gtc gag gtg age ccg cac 
Arg Met Ala Asp Gin Gly Tyr Arg Phe Phe Val Glu val Ser Pro His 
820 825 830 



ggc ggg gtg gtg gtc ggt teg ctg egg cgt ggc gag ggc ggc tec egg 
Gly Gly val val Val Gly Ser Leu Arg Arg Gly Glu Gly Gly Ser Arg 
850 855 ' 860 



ccc ctg etc ggg get gcg gtg acg ctg gcc gat gcg ggc ggg ttc ctg 
Pro Leu Leu Gly Ala Ala Val Thr Leu Ala Asp Ala Gly Gly Phe Leu 
930 935 940 



gtg gtc ggc ggg gcg ate ctg ctg ccc ggc acc gcg ttc gtg gaa atg 
val val Gly Gly Ala He Leu Leu Pro Gly Thr Ala Phe val Glu Met 
965 970 975 



2400 



tgg tac cag aac ctt egg cag ccc gtc cgc ttc cag cag acc gtc gcc 2448 
Trp Tyr Gin Asn Leu Arg Gin Pro Val Arg Phe Gin Gin Thr val Ala 
805 810 815 



2496 



ccg ctg etc acc gcc gga ate cag gaa acg ctg gaa gcc gcg gac gcg 2544 
Pro Leu Leu Thr Ala Gly He Gin Glu Thr Leu Glu Ala Ala Asp Ala 
835 840 845 



2592 



cgc tgg ctg act teg ctg gcc gag tgc cag gtg cgc gga ctg ccg gtg 2640 

Arg Trp Leu Thr Ser Leu Ala Glu Cys Gin val Arg Gly Leu Pro val 

865 870 875 ' 880 

aat tgg gaa cag gta ttc etc aac acc gga gcc cga cgc gtg ccg ctg 2688 

Asn Trp Glu Gin Val Phe Leu Asn Thr Gly Ala Arg Arg Val Pro Leu 

885 890 895 

ccg acc tac ccg ttc cag egg cag egg tac tgg ttg gag tec gcc gag 2736 

pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Glu Ser Ala Glu 

900 905 910 

tac gac gcg ggc gat etc ggt teg gtg ggc ttg etc tec gcc gag cat 2784 

Tyr Asp Ala Gly Asp Leu Gly Ser Val Gly Leu Leu Ser Ala Glu His 

915 920 925 



2832 



ctg acc ggc aag ctg teg gtc aag acc cag ccc tgg ttg gcc gac cac 2880 
Leu Thr Gly Lys Leu Ser val Lys Thr Gin Pro Trp Leu Ala Asp His 
945 950 955 960 



2928 



ctg ata cgc gcc gcg gac cag gtc ggg tgc gat ctg ate gag gag ttg 2976 
Leu lie Arg Ala Ala Asp Gin Val Gly Cys Asp Leu lie Glu Glu Leu 
980 985 990 
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tec ctg acg act ccg ctg gtt ttg ccc gcg acc ggt gcg gtg cag gtg 
ser Leu Thr Thr Pro Leu Val Leu Pro Ala Thr Gly Ala Vat Gin Val 
995 1000 1005 

cag ate gcg gtt ggc ggt ccg gac gag gec ggg cgc cgc teg gtc cgc 
Gin lie Ala val Gly Gly Pro Asp Glu Ala Gly Arg Arg Ser Val Arg 
1010 1015 1020 

gtq cat tec tgt cga gac gac gee gtq ccg cag gac teg tgg acc tgc 
val His Ser Cys Arg Asp Asp Ala val Pro Gin Asp Ser Trp Thr Cys 
1025 1030 1035 1040 



caa gee gtq etc tec ate gat teg ctg gtq ctg cgc cag att teg gac 
Gin Ala val Leu Ser lie Asp Ser Leu Val Leu Arg Gin lie Ser Asp 
1185 1190 1195 1200 



gtc gac tgg aag cga ate teg ccc gqc get gee aag ccg gtc tec tgg 
val Asp Trp Lys Arg lie Ser Pro Gly Ala Ala Lys Pro Val Ser Trp 
1220 ~ 1225 1230 



3024 



3072 



3120 



cac gcg acc gqc acg ttg acc tec age gat cac cag gac gee gqc cag 3168 

His Ala Thr Gly Thr Leu Thr Ser Ser Asp His Gin Asp Ala Gly Gin 

1045 1050 1055 

gqc ccc gat gqg att tgg ccg ccc aac gat get gtc gcg gtt ccg ctg 3216 

Gly pro Asp Gly lie Trp Pro Pro Asn Asp Ala Val Ala val Pro Leu 
1060 1065 1070 

gac age ttc tac gec cgc gca get gag egg gqc ttc gat ttc gqc ccg 3264 

Asp Ser Phe Tyr Ala Arg Ala Ala Glu Arg Gly Phe Asp Phe Gly Pro 
1075 1080 1085 

gcg ttc cag gqg ttg cag gcg get tgg aag cgc gga gac gag ate ttc 3312 

Ala Phe Gin Gly Leu Gin Ala Ala Trp Lys Arg Gly Asp Glu lie Phe 
1090 1095 1100 

gee gag gtc gqc ctg ccc acc gca cac cgc gaa gac gec gqc agg ttc 3360 

Ala Glu val Gly Leu Pro Thr Ala His Arg Glu Asp Ala Gly Arg Phe 

1105 1110 1115 1120 

gqa ate cac cct get ctg ctg gat gcg gca ctg cag gcg ctg gqc gca 3408 

Gly lie His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Leu Gly Ala 

1125 1130 1135 

gec gaa gag gat ccg gac gag gga tgg etc ccg ttc gcg tgg caa ggt 3456 

Ala Glu Glu Asp Pro Asp Glu Gly Trp Leu Pro Phe Ala Trp Gin Gly 
1140 1145 1150 

gtg tec etc aaa gcg acg ggc gca ctt tec ctt egg gtg cac etc gtt 3504 

val ser Leu Lys Ala Thr Gly Ala Leu Ser Leu Arg val His Leu Val 
1155 1160 1165 

ccg gcg ggc gcg aat gcg gtg teg gtg ttc acg acc gac acg act ggc 3552 

Pro Ala Gly Ala Asn Ala val Ser Val Phe Thr Thr Asp Thr Thr Gly 
1170 1175 1180 



3600 



aag cag ttg gca gcg gec cgt gcg atg gaa cac gag tec ctg ttc egg 3648 
Lys Gin Leu Ala Ala Ala Arg Ala Met Glu His Glu Ser Leu Phe Arg 
1205 ~ 1210 1215 



3696 



gca gtq ate gqc aat gac gaa etc gee cga gec tgc ggc teg gca ctt 3744 
Ala Val lie Gly Asn Asp Glu Leu Ala Arg Ala Cys Gly Ser Ala Leu 
1235 1240 ^ 1245 
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ggc acg gaa etc cac ccc gac ctg acc ggg ttg get gac ccg ccc ccg 3792 
Gly Thr Glu Leu His Pro Asp Leu Thr Gly Leu Ala Asp Pro Pro Pro 
1250 1255 1260 

gac gtc gtg gtg gtg cca tgc ggt gcg tct cgc cag gac ttg gac gtt 3840 
Asp val val val val Pro Cys Gly Ala Ser Arg Gin Asp Leu Asp val 
1265 1270 1275 1280 

get tec gag gca cgt gec gcg aca caa cgc atg ctt gac ctg ate cag 3888 
Ala Ser Glu Ala Arg Ala Ala Thr Gin Arg Met Leu Asp Leu He Gin 
1285 1290 1295 

gat tgg ttg gcg gcg gcg cga ttc gec gga tct cgc ctg gtq gtt gtq 3936 
Asp Trp Leu Ala Ala Ala Arg Phe Ala Gly Ser Arg Leu val Val Val 
1300 " 1305 1310 

acg tgt gqt gcg gcg teg aca gqt ccc gec gag gqt gtt tec gac ctg 3984 
Thr Cys Gly Ala Ala Ser Thr Gly Pro Ala Glu Gly Val Ser Asp Leu 
1315 1320 1325 

gtq cat get gcg teg tgg gqt ttg ttg cgt teg gcg cag teg gag aac 4032 
Val His Ala Ala Ser Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Asn 
1330 1335 " 1340 

ccg gac cga ttc gtq ttg gtc gat gtq gac gqa acc gec gaa tea tgg 4080 
Pro Asp Arg Phe val Leu val Asp val Asp Gly Thr Ala Glu Ser Trp 
1345 1350 1355 1360 

cgt gcg etc gcg gcg gee gtq cgt tec gqa gaa ccg cag ctg gcg ttg 4128 
Arg Ala Leu Ala Ala Ala val Arg Ser Gly Glu Pro Gin Leu Ala Leu 
1365 - 1370 1375 

cgc gee gqt gaa gtc egg gtq cct cgc ctg gcg cga tgt gtt gec gec 4176 
Arg Ala Gly Glu val Arg val Pro Arg Leu Ala Arg Cys val Ala Ala 
1380 1385 1390 

gag gac age egg ate cca gtq ccc gqt gcg gat ggg acg gtq ttg att 4224 
Glu Asp Ser Arg lie Pro val Pro Gly Ala Asp Gly Thr val Leu lie 
1395 1400 1405 

tec gqc gqt acg gqc ctg ctg gqc ggg ttg gtt gec egg cat ttg gtg 4272 
Ser Gly Gly Thr Gly Leu Leu Gly Gly Leu Val Ala Arg His Leu val 
1410 1415 1420 

gcg gag cgc ggt gtc cgc cgc ctg gtg etc gcg ggg cga cgc ggc tgg 4320 
Ala Glu Arg Gly val Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp 
1425 1430 ~ 1435 ~ 1440 

age gee ccc ggg gtc acc gac ctg gtg gat gag ttg gtg ggc ctg gga 4368 
ser Ala Pro Gly Val Thr Asp Leu Val Asp Glu Leu Val Gly Leu Gly 
1445 1450 1455 

get gcg gtc gag gtg gcg age tgc gat gtc ggg gat egg gec cag ttg 4416 
Ala Ala val Glu val Ala ser cys Asp val Gly Asp Arg Ala Gin Leu 
1460 1465 1470 

gac egg ctg ctg acg acg ate teg gca gag ttc ccg ctg cgc gga gtg 4464 
Asp Arg Leu Leu Thr Thr He Ser Ala Glu Phe Pro Leu Arg Gly Val 
1475 1480 1485 

gtg cat gcg gec ggg gca ctt gec gac ggg gtc gtc gag teg ctg aca 4512 
val His Ala Ala Gly Ala Leu Ala Asp Gly val Val Glu Ser Leu Thr 
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1490 1495 1500 

cca gag cac gtg gca aag gtg ttc ggc ccg aag gcc gcc ggt gcg tgg 4560 
Pro Glu His val Ala Lys Val Phe Gly Pro Lys Ala Ala Gly Ala Trp 
1505 1510 1515 1520 

cac ctg cac gag ttg act ctt gat ctg gat etc teg ttc ttc gtg etc 4608 
His Leu His Glu Leu Thr Leu Asp Leu Asp Leu Ser Phe Phe val Leu 
1525 1530 1535 

ttc tec teg ttc tec ggc gtg gcg ggg get gcg ggt cag gga aac tac 4656 
Phe Ser ser Phe Ser Gly val Ala Gly Ala Ala Gly Gin Gly Asn Tyr 
1540 1545 1550 

gcg gcg gcg aac gcg ttc ctg gac ggc ctg get cag cac egg egg acg 4704 
Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu Ala Gin His Arg Arg Thr 
1555 1560 1565 

gcg ggg ctg cct gcg gtg teg ctg get tgg ggc ttg tgg gag cag ccc 4752 
Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro 
1570 1575 1580 



a 9 c 999 atg a cc gga gcg etc gat gcg gcg ggc cgt age cgc att gcg 
Ser Gly Met Thr Gly Ala Leu Asp Ala Ala Gly Arg Ser Arg lie Ala 
1585 1590 1595 ~ 1600 



teg gcg gtg ctg ggg cat teg ggt gcg gtc gag gtc ggg gcc gat cgt 
ser Ala val Leu Gly His Ser Gly Ala val Glu Val Gly Ala Asp Arg 
1700 1705 1710 

get ttc egg gat ttg ggt ttt gat teg ttg tec ggt gtg gag ttt egg 
Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly val Glu Phe Arg 
1715 1720 1725 

aac egg ctt ggc ggg gtg ctg ggc gtt egg ttg ccg get act gcg gtg 
Asn Arg Leu Gly Gly val Leu Gly Val Arg Leu Pro Ala Thr Ala val 
1730 1735 ^ 1740 



4800 



cgc ace aat ccg ccg atg tec gcg ccg gac ggg ttg egg ctg ttc gag 4848 
Arg Thr Asn Pro Pro Met Ser Ala Pro Asp Gly Leu Arg Leu Phe Glu 
1605 1610 1615 

atg gcg ttt cgc gtt ccg ggc gaa teg ctt ctg gtt ccg gtc cac gtc 4896 
Met Ala Phe Arg val Pro Gly Glu Ser Leu Leu val Pro val His val 
1620 1625 1630 

gac ctg aac gcc ctg cgc get gat gcg gcc gac ggc ggt gtg cct gcg 
Asp Leu Asn Ala Leu Arg Ala Asp Ala Ala Asp Gly Gly val Pro Ala 
1635 1640 1645 



4944 



ttg ttg cgc gac ctg gtg cca gcg ccc gtg egg egg age gcg gtc aac 4992 

Leu Leu Arg Asp Leu val Pro Ala Pro val Arg Arg Ser Ala val Asn 
1650 1655 1660 

gag teg gcg gac gtc aac ggt ctg gtt ggt egg ctg egg agg ctg ccg 5040 

Glu Ser Ala Asp Val Asn Gly Leu Val Gly Arg Leu Arg Arg Leu Pro 
1665 1670 1675 ~ 1680 

gac ctg gat cag gaa acc cag ctg ttg ggt ttg gtg cgc gag cat gtt 5088 

Asp Leu Asp Gin Glu Thr Gin Leu Leu Gly Leu val Arg Glu His val 
1685 1690 1695 



5136 



5184 



5232 



ttc gac tat ccg aca ccg egg gcg ttg gtt egg ttc ttg etc gac aaa 5280 

Page 213 



LeA33955-US 

Phe Asp Tyr Pro Thr Pro Arg Ala Leu val Arg Phe Leu Leu Asp Lys 
1745 1750 1755 1760 

ctg att gqt gqc gtq gag get ccg act ccc gca ccg gcg get gtq gcg 5328 
Leu lie Gly Gly Val Glu Ala Pro Thr Pro Ala Pro Ala Ala val Ala 
1765 1770 1775 

gcg gtq act get gac gat ccc gtt gtq ate gtq gqg atg gqc tgt cgt 5376 
Ala val Thr Ala Asp Asp Pro Val Val lie val Gly Met Gly Cys Arg 
1780 1785 1790 

tat ccg gqt gqg gtq tec teg ccg gag gag ctt tgg cgt ttg gtq gee 5424 
Tyr Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala 
1795 1800 1805 

gqg gqc ttg gat gcg gtq gcg gag ttc ccg gac gat cgt gqc tgg gat 5472 
Gly Gly Leu Asp Ala Val Ala Glu Phe Pro Asp Asp Arg Gly Trp Asp 
1810 1815 1820 

cag gcg gqg ttg ttc gat ccg gat ccc gat cgt ctt gqg acc teg tat 5520 
Gin Ala Gly Leu Phe Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr 
1825 1830 1835 1840 



gtq tgt gag gqt gqc ttc ctg cga gat gcg gca gag ttc gat gec gqt 
val cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Gly 
1845 1850 1855 



egg gtt gcg tat teg ttt gqt ttc gag gqt cct gcg gtq acg gtq gat 
Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala val Thr Val Asp 
1940 1945 1950 



5568 



ttc ttc gqg att tec ccg cgt gag gcg ttg gcg atg gat ccg cag cag 5616 

Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin 
1860 1865 1870 

egg ttg ctg ctg gaa gtc get tgg gaa acc gtq gag egg gcg gqg att 5664 

Arg Leu Leu Leu Glu val Ala Trp Glu Thr Val Glu Arg Ala Gly lie 
1875 1880 1885 

gat ccg ctt teg ttg egg gqg age egg acc gqc gtq ttc gcg gqg ctg 5712 

Asp Pro Leu Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu 
1890 1895 ~ 1900 

atg cac cac gac tac gqc gcg egg ttc ate acg agg gcg ccg gag gqt 5760 

Met His His Asp Tyr Gly Ala Arg Phe lie Thr Arg Ala Pro Glu Gly 
1905 1910 1915 1920 

ttc gag gqt tat eta gqt aat gqc age gcg gqa gqc gtq ttt teg gqt 5808 

Phe Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Gly Val Phe Ser Gly 
1925 ' ' 1930 ^ 1935 



5856 



acg gcg tgt teg teg teg ttg gtq gcg ctg cac ctg gcg gqt caa gca 5904 
Thr Ala Cys Ser ser ser Leu val Ala Leu His Leu Ala Gly Gin Ala 
1955 1960 1965 

ctg egg tct gqt gag tgt gat ctg get ctt gcg gqt gqt gtq acg gtq 5952 
Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly val Thr val 
1970 1975 1980 

atg gec acg ccg gqg atg ttc gtq gag ttt teg cgt caa egg gqc ttg 6000 
Met Ala Thr Pro Gly Met Phe val Glu Phe Ser Arg Gin Arg Gly Leu 
1985 1990 1995 ~ "* 2000 

Page 214 



LeA33955-US 

gcg gcg gat gqg egg tgc aag teg ttt gcg gcg get gcg gat gqc acc 6048 

Ala Ala Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr 

2005 2010 2015 

gqt tgg gqa gaa gqc gcg gqc ttg gtq ttg ttg gag egg ctg teg gat 6096 

Gly Trp Gly Glu Gly Ala Gly Leu val Leu Leu Glu Arg Leu Ser Asp 

2020 2025 2030 

gee egg cgc aac gqg cac gcg gtt ctg gcg gtc gtq egg gqt age gcg 6144 

Ala Arg Arg Asn Gly His Ala val Leu Ala Val Val Arg Gly Ser Ala 

2035 2040 2045 

gtq aat cag gat gqt gcg teg aat gqt ttg acg gcg ccg aat gqg ccc 6192 

val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
2050 2055 2060 

teg cag cag egg gtq ate acg cag gcg ttg gcg agt get ggt ttg teg 6240 

Ser Gin Gin Arg val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser 
2065 ~ 2070 2075 2080 

gtq tct gat gtq gac gee gtq gag gcg cat gqg act gqa acc agg ctt 6288 

val ser Asp val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 

2085 2090 2095 

gqt gat ccg att gag gcg cag get ctg att gec act tac gqg cag gqg 6336 

Gly Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Gin Gly 

2100 2105 2110 

egg gat age gat egg ccg ttg tgg ttg gqg teg gtq aag teg aat att 6384 

Arg Asp ser Asp Arg Pro Leu Trp Leu Gly ser Val Lys Ser Asn lie 

2115 2120 2125 

gqt cat acg cag gcg gcg gcg gqt gtc get gqt gtq ate aag atg gtq 6432 

Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val 
2130 2135 2140 

atg gcg atg egg cac gqg cag ctg ccc gcg acg ttg cat gtq gat gaa 6480 

Met Ala Met Arg His Gly Gin Leu Pro Ala Thr Leu His val Asp Glu 
2145 2150 2155 2160 

cct acg teg gaa gtq gat tgg teg gcg gqg gat gtc cag etc etc acg 6528 

Pro Thr ser Glu val Asp Trp Ser Ala Gly Asp Val Gin Leu Leu Thr 

2165 2170 2175 

gag aac acc ccc tgg ccc gqc aac age cat cct egg egg gtq gqc gtq 6576 

Glu Asn Thr Pro Trp Pro Gly Asn ser His Pro Arg Arg val Gly val 

2180 2185 2190 

teg teg ttc gqg ate age gqc acc aac gca cac gtc ate etc gaa caa 6624 

Ser Ser Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Gin 

2195 2200 2205 

gee teg aaa aca cca gac gag act gcg gac aag age gqt ccc gat teg 6672 

Ala Ser Lys Thr Pro Asp Glu Thr Ala Asp Lys Ser Gly Pro Asp Ser 
2210 2215 2220 

gaa teg acc gtq gac ctt cca gcg gtc ccg ttg ate gtq teg gqg aga 6720 

Glu Ser Thr Val Asp Leu Pro Ala Val Pro Leu lie Val Ser Gly Arg 
2225 2230 2235 2240 

aca ccg gca gcg etc age get cag gcg age gca ttg ttg tec tat ttg 6768 

Thr Pro Ala Ala Leu Ser Ala Gin Ala Ser Ala Leu Leu Ser Tyr Leu 

2245 2250 2255 
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gqt gag cgt gqc gat att tec acg ctg gat gcg gcg ttt teg ttg get 
Gly Glu Arg Gly Asp lie Ser Thr Leu Asp Ala Ala Phe Ser Leu Ala 
2260 2265 2270 



gcg gcg gtt gcg gcg gqa gtq ttg teg ttg ccg gat gcg get egg atg 
Ala Ala val Ala Ala Gly val Leu Ser Leu Pro Asp Ala Ala Arg Met 
2420 2425 2430 

gtq gcg gqt cgt gee egg ttg atg cag gcg ttg cct tct gqc gqt gee 
val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala 
2435 2440 2445 

atg ttg gcg gtq get get gqt gag gag cag ctg egg ccg ttg ttg gee 
Met Leu Ala val Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala 
2450 2455 2460 

gat egg gtt gat gqt gcg gqt ate gee gcg gtc aac get cct gag teg 
Asp Arg val Asp Gly Ala Gly lie Ala Ala val Asn Ala Pro Glu Ser 
2465 2470 2475 2480 



ctg gat gqg caa gqg att egg tgg egg egg ttg egg gtt teg cat gcg 
Leu Asp Gly Gin Gly lie Arg Trp Arg Arg Leu Arg Val Ser His Ala 
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tec tec egg gee gcg ttg gag gag egg gcg gtq gtq ctg gqa gcg gac 6864 
ser ser Arg Ala Ala Leu Glu Glu Arg Ala Val val Leu Gly Ala Asp 
2275 2280 2285 



cgc gaa acg ttg ttg tec gqg ttg gaa gcg ctg get tec gqt cgc gag 6912 
Arg Glu Thr Leu Leu ser Gly Leu Glu Ala Leu Ala Ser Gly Arg Glu 
2290 2295 2300 

6960 



7008 



get tct gqg gtq gtq teg gqa tec ccg gtc tct gqc gqg gtt gqg ttc 
Ala Ser Gly val val Ser Gly Ser pro val Ser Gly Gly val Gly Phe 
2305 2310 2315 2320 

gtq ttc gee gqt cag gqc gqa cag tgg ttg gqg atg gqc egg gqg etc 
val Phe Ala Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu 
2325 2330 2335 

tac teg gtt ttt ccg gtq ttc get gac gcg ttt gac gaa gca tgt gee 7056 
Tyr ser val Phe Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala 
2340 2345 2350 

gqa ctg gac gcg cat ctg gqg cag gac gtq gqg gtc egg gat gtq gtq 7104 
Gly Leu Asp Ala His Leu Gly Gin Asp val Gly val Arg Asp val val 
2355 2360 2365 

ttt gqt tec gac gqg tec ttg ttg gat egg acg ctg tgg gee cag teg 7152 
Phe Gly ser Asp Gly Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser 
2370 2375 2380 

gqt ttg ttc gcg ttg cag gtt gqt ttg ctg age ctg ctg gqt teg tgg 7200 
Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Ser Leu Leu Gly Ser Trp 
2385 2390 2395 2400 

gqt gtc egg ccg gqt gtq gtq ctg gqc cat teg gtc gqc gag ttc gcg 7248 
Gly val Arg Pro Gly Val Val Leu Gly His Ser Val Gly Glu Phe Ala 
2405 2410 2415 



7296 



7344 



7392 



7440 



gtq gtq etc tec gqc gat egg gag gtq ctt gac gac ate gec gqc gcg 7488 
val Val Leu ser Gly Asp Arg Glu val Leu Asp Asp lie Ala Gly Ala 
2485 " 2490 2495 



7536 
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2500 2505 2510 

ttt cat teg tat egg atg gac ccg atg ttg cag gag ttc gec gaa ate 7584 

Phe His ser Tyr Arg Met Asp Pro Met Leu Gin Glu Phe Ala Glu lie 
2515 2520 2525 

gca cgc age gtg gac tac egg cgt gqc gac eta ccg gtc gtq teg acg 7632 

Ala Arg ser val Asp Tyr Arg Arg Gly Asp Leu Pro Val val Ser Thr 
2530 2535 2540 

ttg acg ggt gag etc gac ace gca gqt gtq atg get acg ccg gag tat 7680 

Leu Thr Gly Glu Leu Asp Thr Ala Gly Val Met Ala Thr Pro Glu Tyr 

2545 2550 2555 2560 

tgg gtq cgt cag gtt cga gag ccc gtc cgc ttc gec gac gqc gtc egg 7728 

Trp val Arg Gin val Arg Glu Pro Val Arg Phe Ala Asp Gly val Arg 

2565 2570 2575 

gtq etc gcg cag caa gqg gtc gec acg ate ttc gaa etc gqc cct gat 7776 

Val Leu Ala Gin Gin Gly Val Ala Thr lie Phe Glu Leu Gly Pro Asp 
2580 2585 2590 

gcg acg ctg teg gee ctg att ccc gat tgt cat teg tgg get gat cag 7824 

Ala Thr Leu ser Ala Leu lie Pro Asp Cys His ser Trp Ala Asp Gin 
2595 2600 2605 

gee atg ccg att ccg atg ctg cgt aaa gac cgt acg gaa acc gaa act 7872 

Ala Met Pro lie Pro Met Leu Arg Lys Asp Arg Thr Glu Thr Glu Thr 
2610 2615 2620 

gtq gtc gec gcg gtq gcg egg gcg cac acg cgt gqt gtt ccg gtc gaa 7920 

val val Ala Ala Val Ala Arg Ala His Thr Arg Gly val Pro Val Glu 

2625 2630 2635 2640 

tgg teg gcg tat ttc gec gqc acc gqg gca egg egg gtc gag ttg ccg 7968 

Trp ser Ala Tyr Phe Ala Gly Thr Gly Ala Arg Arg Val Glu Leu Pro 

2645 2650 ~ 2655 

acg tat gec ttc cag egg cag egg tac tgg ctg gaa aca teg gat tac 8016 

Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Glu Thr Ser Asp Tyr 
2660 2665 2670 

gqc gat gtq acg gqt ate gqc ctg get gcg gcg gag cat ccg ttg ctg 8064 

Gly Asp val Thr Gly lie Gly Leu Ala Ala Ala Glu His Pro Leu Leu 
2675 2680 2685 

gqg gec gtq gtt gcg ctg gee gat gqt gat gqg atg gtq ctg acc gqc 8112 

Gly Ala Val val Ala Leu Ala Asp Gly Asp Gly Met val Leu Thr Gly 
2690 2695 2700 

egg ttg teg gtq gqg acg cat ccg tgg ctg gee cag cat cgc gtq ctg 8160 

Arg Leu Ser val Gly Thr His Pro Trp Leu Ala Gin His Arg Val Leu 

2705 2710 2715 ~ 2720 

ggc gag gtc gtc gtc ccc gqc acc gec ate ctg gag atg gec ctg cac 8208 

Gly Glu val val val Pro Gly Thr Ala lie Leu Glu Met Ala Leu His 

2725 2730 2735 

gca gqg gcg cgt etc gqc tgt gac egg gtq gaa gag etc acc ctg gaa 8256 

Ala Gly Ala Arg Leu Gly cys Asp Arg val Glu Glu Leu Thr Leu Glu 
2740 2745 2750 

aca ccg ctg gtg gtc ccc gaa cgc gcg gcg ggt gec ggt agt cgt ggc 8304 

Page 217 



LeA33955-US 

Thr Pro Leu val val Pro Glu Arg Ala Ala Gly Ala Gly Ser Arg Gly 
2755 2760 2765 

cct gcg gga ggg acc aca gtt tea att gaa act gcg gaa gaa cgt gtg 8352 

Pro Ala Gly Gly Thr Thr val Ser lie Glu Thr Ala Glu Glu Arg Val 

2770 2775 2780 

egg acg aac gac gec ate gaa ate cag ctg ctg gtq aac gca ccc gac 8400 

Arg Thr Asn Asp Ala lie Glu lie Gin Leu Leu val Asn Ala Pro Asp 
2785 2790 2795 2800 

gaa ggc ggt egg cga agg gtg teg ctg tat tec cgc ccg gee ggt gqg 8448 

Glu Gly Gly Arg Arg Arg Val Ser Leu Tyr Ser Arg Pro Ala Gly Gly 
2805 " 2810 ~ 2815 

teg aga gqt gqg gqt tgg acg cgc cac gec acc ggc gaa etc gtc gtc 8496 

ser Arg Gly Gly Gly Trp Thr Arg His Ala Thr Gly Glu Leu val val 
2820 2825 2830 

ggc acc acc gqt gqt agg gcg gtt cct gat tgg teg get gag gqt gec 8544 

Gly Thr Thr Gly Gly Arg Ala Val Pro Asp Trp Ser Ala Glu Gly Ala 
2835 " 2840 2845 

gag teg att get etc gat gag ttc tac gtc get ctg gee gqa aac gqg 8592 

Glu Ser lie Ala Leu Asp Glu Phe Tyr val Ala Leu Ala Gly Asn Gly 

2850 2855 2860 

ttc gag tac gqg ccg ttg ttc cag gqg ctt cag gcg gca tgg cgt cgt 8640 

Phe Glu Tyr Gly Pro Leu Phe Gin Gly Leu Gin Ala Ala Trp Arg Arg 
2865 2870 2875 2880 

gqt gac gag gtt etc gee gaa ate gec ccg ccg gee gag gee gat gcg 8688 

Gly Asp Glu val Leu Ala Glu lie Ala Pro Pro Ala Glu Ala Asp Ala 
2885 2890 2895 

atg gcg teg gqa tac ctg etc gac cca gcg ttg ctg gat gec gcg ctg 8736 

Met Ala Ser Gly Tyr Leu Leu Asp Pro Ala Leu Leu Asp Ala Ala Leu 
2900 2905 2910 

cag gcg tec gcg etc ggc gac cgc ccg gag caa ggc ggc gcg tgg ctg 8784 

Gin Ala Ser Ala Leu Gly Asp Arg Pro Glu Gin Gly Gly Ala Trp Leu 
2915 2920 2925 

ccg ttc tea ttc acc ggc gtc gaa ctt tec get ccg gca ggg acg ate 8832 

Pro Phe Ser Phe Thr Gly val Glu Leu ser Ala Pro Ala Gly Thr lie 

2930 2935 2940 

age agg gtg egg ctg gag acc agg cga ccc gac gcg ata teg gtg gee 8880 

Ser Arg val Arg Leu Glu Thr Arg Arg Pro Asp Ala lie Ser Val Ala 
2945 2950 2955 2960 

gtg atg gat gag agt ggg egg ttg etc gec teg ate gat tct etc agg 8928 

val Met Asp Glu Ser Gly Arg Leu Leu Ala Ser lie Asp Ser Leu Arg 
2965 2970 2975 

eta cga age gtg teg teg gga cag ctg gcg aat egg gac get gtc cgc 8976 

Leu Arg Ser val Ser Ser Gly Gin Leu Ala Asn Arg Asp Ala Val Arg 
2980 2985 2990 

gac gcg ctg ttc gag gtg acc tgg gag ccg gtg gcg acg cag teg acg 9024 

Asp Ala Leu Phe Glu Val Thr Trp Glu Pro val Ala Thr Gin Ser Thr 
2995 3000 3005 
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gaa ccg gqt cgc tgg gcc ctg ctt gqt gat act gcc tgc ggt aaa gac 9072 

Glu Pro Gly Arg Trp Ala Leu Leu Gly Asp Thr Ala Cys Gly Lys Asp 
3010 3015 3020 

gat etc ate aaa etc gca acg gat tec gcc gac cgc tgc gcg gat ctg 9120 

Asp Leu lie Lys Leu Ala Thr Asp Ser Ala Asp Arg Cys Ala Asp Leu 
3025 3030 3035 3040 

gcg gcg eta gcc gag aaa ctt gat tec age gcg ctg gtt cct gat gtc 9168 

Ala Ala Leu Ala Glu Lys Leu Asp Ser Ser Ala Leu Val Pro Asp val 
3045 3050 3055 

gtq gtc tac tgc gcc gqa gaa cag gcg gat ccc gqc ace gqc gca gcc 9216 

val val Tyr cys Ala Gly Glu Gin Ala Asp Pro Gly Thr Gly Ala Ala 
3060 3065 3070 

gca ctt gcg gag acc cag cag acg ttg get ctg etc caa gcg tgg ttg 9264 

Ala Leu Ala Glu Thr Gin Gin Thr Leu Ala Leu Leu Gin Ala Trp Leu 
3075 3080 3085 

get gag ccg egg ttg gcc gag gca cgt ctg gtq gtq gtq acg tgt gca 9312 

Ala Glu Pro Arg Leu Ala Glu Ala Arg Leu Val Val Val Thr Cys Ala 
3090 3095 3100 

gcg gtq acg acg get ccg agt gac gqt gca tea gag ctg gca cat gcg 9360 

Ala val Thr Thr Ala Pro Ser Asp Gly Ala Ser Glu Leu Ala His Ala 
3105 3110 3115 3120 

ccg ttg tgg gqg ttg ttg cgt gcc gcg cag gtq gag aac ccg gqg cag 9408 

Pro Leu Trp Gly Leu Leu Arg Ala Ala Gin val Glu Asn Pro Gly Gin 
3125 ~ 3130 3135 

ttt gtq ctg gcg gac gtc gac gqa acc gcc gaa teg tgg cgt gcg ttg 9456 

Phe val Leu Ala Asp val Asp Gly Thr Ala Glu Ser Trp Arg Ala Leu 
3140 3145 3150 

ccg agt gcg ttg gqc teg atg gaa ccg cag ttg gcc ctg egg aag gqc 9504 

Pro Ser Ala Leu Gly ser Met Glu Pro Gin Leu Ala Leu Arg Lys Gly 
3155 3160 3165 

gcg gtq cga gcg ccc cgc ttg get teg gtc gcc gqg cag ate gac gtq 9552 

Ala val Arg Ala Pro Arg Leu Ala Ser val Ala Gly Gin lie Asp Val 
3170 ~ 3175 3180 

ccc gcg gtt gtq gcg gat ccc gac cga acc gtq ctg att teg gqc gqc 9600 

Pro Ala val val Ala Asp Pro Asp Arg Thr val Leu lie Ser Gly Gly 
3185 3190 " 3195 3200 

acg gqc ctg ttg gqg gqc gcg gtt gcc cgc cac ctg gtq acc gaa cgc 9648 

Thr Gly Leu Leu Gly Gly Ala val Ala Arg His Leu val Thr Glu Arg 
3205 3210 3215 

gqt gtc cgc cga ttg gtq ttg acg gqc cgt cgt gqc tgg gat get cct 9696 

Gly val Arg Arg Leu val Leu Thr Gly Arg Arg Gly Trp Asp Ala Pro 
3220 3225 3230 

gqa ate acc gag ttg gtg gqt gag ctg aac gqc etc gqt gcc gtq gtc 9744 

Gly lie Thr Glu Leu Val Gly Glu Leu Asn Gly Leu Gly Ala val val 
3235 3240 3245 

gac gtq gtq gcg tgc gac gtc gcg gat cgt get gat ctg gag teg ttg 9792 

Asp Val Val Ala Cys Asp Val Ala Asp Arg Ala Asp Leu Glu Ser Leu 
3250 3255 3260 
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ctg gcg gcg gtc ccg gcg gaa ttt ccg ttg tgc ggc gtg gtg cat gcc 9840 

Leu Ala Ala val Pro Ala Glu Phe Pro Leu Cys Gly val val His Ala 
3265 3270 3275 3280 

gcg ggg gcg ctg gcc gac ggg gtg ate gag teg ttg tea ccg gac gac 9888 

Ala Gly Ala Leu Ala Asp Gly Val lie Glu Ser Leu Ser Pro Asp Asp 

3285 3290 3295 

gtg gga gcg gtg ttc ggc ccg aag gcg gcg ggg gcg tgg aat ctg cac 9936 

val Gly Ala val Phe Gly Pro Lys Ala Ala Gly Ala Trp Asn Leu His 

3300 3305 3310 

gag ctg act cgt gat acg gac ctg teg ttc ttc gcg ttg ttc tec teg 9984 

Glu Leu Thr Arg Asp Thr Asp Leu Ser Phe Phe Ala Leu Phe ser ser 
3315 3320 3325 

ctt tec ggt gtt gcc ggc get cct ggt cag ggc aat tat gcg gcg gcg 10032 

Leu Ser Gly val Ala Gly Ala Pro Gly Gin Gly Asn Tyr Ala Ala Ala 
3330 3335 3340 

aac gcg ttc ctg gac gca ttg gcg cat tac egg egg tea cag gga ctg 10080 

Asn Ala Phe Leu Asp Ala Leu Ala His Tyr Arg Arg ser Gin Gly Leu 
3345 3350 3355 " 3360 

cct gcg gtg teg ctg gcc tgg ggc ctg tgg gag cag ccg age ggg atg 10128 

Pro Ala val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro ser Gly Met 

3365 3370 3375 

acg gag acg etc age gag gtc gac egg age agg ate gcg cgc gcc aac 10176 

Thr Glu Thr Leu ser Glu Val Asp Arg Ser Arg lie Ala Arg Ala Asn 

3380 3385 3390 

ccg ccg ttg tec ace aag gag gga ttg egg ctg ttc gat gcc ggg ctg 10224 

Pro Pro Leu Ser Thr Lys Glu Gly Leu Arg Leu Phe Asp Ala Gly Leu 
3395 3400 3405 

gcg ctg gac egg gca gcg gta gtt ccg gcg aag ttg gac agg act ttc 10272 

Ala Leu Asp Arg Ala Ala Val Val Pro Ala Lys Leu Asp Arg Thr Phe 
3410 ~ 3415 3420 

ctg gcc gag cag gcg egg teg ggc teg ctg ccc gca ttg ttg acg gca 10320 

Leu Ala Glu Gin Ala Arg ser Gly ser Leu Pro Ala Leu Leu Thr Ala 
3425 3430 3435 3440 

ctg gta ccc ccc ate cgt cgt aat agg egg get age gga acc gag etc 10368 

Leu Val Pro Pro lie Arg Arg Asn Arg Arg Ala Ser Gly Thr Glu Leu 

3445 3450 3455 

gcg gac gag ggc acc ctg etc ggg gtg gtg egg gag cat gcc gcg gcc 10416 

Ala Asp Glu Gly Thr Leu Leu Gly val val Arg Glu His Ala Ala Ala 

3460 3465 3470 

gtg ctg ggg tat teg age gcg get gac gtc ggg gtc gag cgc get ttc 10464 

val Leu Gly Tyr ser ser Ala Ala Asp val Gly Val Glu Arg Ala Phe 
3475 3480 3485 

egg gat ctg ggt ttt gat teg ttg tct ggt gtg gag ttg egg aac cgc 10512 

Arg Asp Leu Gly Phe Asp ser Leu Ser Gly Val Glu Leu Arg Asn Arg 
3490 3495 3500 

ctt gcc ggg gtg ctg ggg gtg egg ttg ccg gcg act gcg gtg ttc gac 10560 

Leu Ala Gly Val Leu Gly val Arg Leu Pro Ala Thr Ala Val Phe Asp 
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3505 3510 3515 3520 

tat ccg acg ccg agg gcg ctg gcc egg ttc ctg cac cag gaa ctg gca 10608 
Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu Leu Ala 
3525 3530 3535 

gac gag ate get acg acg cca gcg ccg gtq acg acg acc agg gca ccg 10656 
Asp Glu He Ala Thr Thr Pro Ala Pro Val Thr Thr Thr Arg Ala Pro 
3540 3545 3550 

gtc gcc gaa gac gat etc gtc gcg ata gtc ggg atg gqa tgc cgt ttt 10704 
val Ala Glu Asp Asp Leu val Ala lie Val Gly Met Gly Cys Arg Phe 
3555 3560 3565 



ccc ggt cag gtq tec teg ccg gag gag etc tgg cgt ttg gtq gcc gqg 
Pro Gly Gin val ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly 
3570 3575 3580 



ccg gqt teg ctg agg gqg agt egg acc gqt gtq ttc gcg gqg ctg atg 

Pro Gly Ser Leu Arg Gly Ser Arg Thr Gly val Phe Ala Gly Leu Met 

3665 3670 3675 3680 

tac cac gac tat ggc gcc egg ttc gcc age cga gcc ccg gaa gqt ttc 

Tyr His Asp Tyr Gly Ala Arg Phe Ala Ser Arg Ala Pro Glu Gly Phe 

3685 3690 3695 



10752 



ggc gtq gat gcg gtc gcg gac ttc cca gcc gat cgc ggc tgg gat ctg 10800 
Gly val Asp Ala Val Ala Asp Phe Pro Ala Asp Arg Gly Trp Asp Leu 
3585 3590 3595 3600 

gca ggc ttg ttc gat ccg gac ccg gaa egg get ggg aag acc tac gtg 10848 
Ala Gly Leu Phe Asp Pro Asp Pro Glu Arg Ala Gly Lys Thr Tyr Val 
3605 3610 3615 

egg gaa ggg gcc ttc etc acc gac gcc gat egg ttc gat gcg gqt ttc 10896 
Arg Glu Gly Ala Phe Leu Thr Asp Ala Asp Arg Phe Asp Ala Gly Phe 
3620 3625 3630 

ttc ggg att tec ccg cgt gag gcg ttg gcg atg gat ccg cag caa egg 10944 
Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
3635 ~ 3640 3645 

ctg ttg ctg gag ctg tec tgg gag gcc att gaa egg gca gqg ate gat 10992 
Leu Leu Leu Glu Leu ser Trp Glu Ala lie Glu Arg Ala Gly lie Asp 
3650 3655 3660 



11040 



11088 



gag ggg tat etc ggc aat ggc agt get ggg agt gtc gcg teg ggc egg 11136 

Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Ser val Ala Ser Gly Arg 
3700 3705 3710 

att gcg tac teg ttt ggt ttc gag ggt cct gcg gtq acg gtq gat act 

lie Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala val Thr val Asp Thr 
3715 3720 3725 

gcg tgt teg teg teg ttg gtg gcg ttg cat ttg gcg ggt cag teg ttg 

Ala Cys ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ser Leu 
3730 3735 3740 

cgt tec ggc gaa tgc gat etc gcc ctt gcc ggt ggt gtq acg gtg atg 

Arg ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr val Met 
3745 3750 3755 3760 



11184 



11232 



11280 



teg acg ccc ggg acg ttt gtg gaa ttc tec cgt cag egg ggc ctg gca 11328 
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Ser Thr Pro Gly Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
3765 3770 " 3775 

ccg gac ggg egg tgc aag teg ttc gcg gag age gcg gac ggt acc ggt 11376 
Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala Asp Gly Thr Gly 
3780 3785 3790 

tgg ggt gag gqt get ggt ttg gtg ttg ttg gag egg ttg teg gat get 11424 
Trp Gly Glu Gly Ala Gly Leu val Leu Leu Glu Arg Leu ser Asp Ala 
3795 3800 3805 

egg egg aat ggg cat egg gtg ttg gcg gtg gtt cgt ggg teg gcg gtg 11472 
Arg Arg Asn Gly His Arg Val Leu Ala Val val Arg Gly ser Ala Val 
3810 3815 3820 

aat cag gat ggt gcg teg aat ggc ttg acc gcg ccg aat ggt ccc teg 11520 
Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
3825 3830 3835 3840 

cag cag egg gtc ate cag cag gcg ttg gcg agt gcg ggt ctg teg gtg 11568 
Gin Gin Arg val lie Gin Gin Ala Leu Ala Ser Ala Gly Leu Ser Val 
3845 3850 3855 

tec gat gtg gat gee gtg gag gcg cat ggg acc ggg acc agg ttg ggt 11616 
Ser Asp val Asp Ala val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
3860 3865 3870 

gat ccg att gag gcg cag get ctg att get acg tat ggg cgc gat cgt 11664 
Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg 
3875 3880 3885 

gat ccc ggt egg ccg ttg tgg ttg ggg teg gtg aag tec aac ate ggt 11712 
Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly 
3890 3895 3900 

cat acg cag gcg gcg gcg ggt gtt gee ggt gtg ate aag atg gtg atg 11760 
His Thr Gin Ala Ala Ala Gly val Ala Gly Val He Lys Met Val Met 
3905 3910 3915 3920 

gcg atg egg cac ggg caa ctt ccg cgc acg ctg cac gtg gat gca ccc 11808 
Ala Met Arg His Gly Gin Leu Pro Arg Thr Leu His val Asp Ala Pro 
3925 3930 3935 

tec teg cag gtg gat tgg teg gcg ggg agg gtc cag etc ctg acg gag 11856 
Ser Ser Gin Val Asp Trp ser Ala Gly Arg Val Gin Leu Leu Thr Glu 
3940 3945 3950 

aac acg ccc tgg ccc gac agt ggt cgc ccc tgt egg gtg ggg gtg teg 11904 
Asn Thr Pro Trp Pro Asp Ser Gly Arg Pro Cys Arg val Gly Val Ser 
3955 3960 3965 

teg ttc ggg ate age ggc acc aac gcg cac gtc ate ctg gaa cag tec 11952 
Ser Phe Gly lie Ser Gly Thr Asn Ala His val lie Leu Glu Gin Ser 
3970 3975 3980 

acg ggg cag atg gat cag gca gcg gag ccg gat teg agt cct gtt ctg 12000 
Thr Gly Gin Met Asp Gin Ala Ala Glu Pro Asp Ser Ser Pro val Leu 
3985 3990 3995 4000 

gat gtt ccg gtg gtg ccg tgg gtg gtg teg ggc aaa aca ccc gaa gcg 12048 
Asp val Pro Val val Pro Trp val val ser Gly Lys Thr Pro Glu Ala 
4005 4010 4015 
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eta tec gec cag gcg gca acg ttg gcg acc tat ttg gac caa aat gtt 12096 
Leu Ser Ala Gin Ala Ala Thr Leu Ala Thr Tyr Leu Asp Gin Asn val 
4020 4025 4030 



gat gtc tec cct ctg gac gtt gqg att teg ctt gcg gtq acc cgt teg 
Asp val Ser pro Leu Asp val Gly He Ser Leu Ala val Thr Arg Ser 
4035 4040 4045 



gtt acg gqa cct gtc gqg att gqt gqc egg acc gqg ttt gtq ttc gee 
val Thr Gly Pro val Gly lie Gly Gly Arg Thr Gly Phe Val Phe Ala 
4085 4090 4095 



ttt ccg gcg ttc gee gqt get ttc gac gaa gca tgc gee gag etc gat 

Phe Pro Ala Phe Ala Gly Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp 
4115 4120 4125 

gcg aac ctg gqg agg gaa gtc gqg gtt egg gat gtq gtq ttc gqc tec 

Ala Asn Leu Gly Arg Glu val Gly Val Arg Asp val Val Phe Gly Ser 
4130 " 4135 4140 

gac gag tec ttg ctg gat egg act ttg tgg gcg cag teg gqt ttg ttc 

Asp Glu Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe 
4145 4150 4155 4160 

gcg ttg cag gtc gqt etc tgg gaa ttg ttg gqt acg tgg gqt gtt egg 

Ala Leu Gin Val Gly Leu Trp Glu Leu Leu Gly Thr Trp Gly Val Arg 
4165 4170 4175 

ccc age gta gtq ctg gqg cat teg gtc gqg gag eta gee gcg gcg ttc 

Pro Ser val Val Leu Gly His Ser val Gly Glu Leu Ala Ala Ala Phe 
4180 4185 4190 

gec gca gqt gtq ctg teg atg gcg gag gcg get egg ctg gtq gcg gqt 

Ala Ala Gly val Leu Ser Met Ala Glu Ala Ala Arg Leu val Ala Gly 
4195 4200 4205 

cgt gcg egg ttg atg cag gcg ttg cct tct gqc gqt gee atg ctg gcg 

Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala 
4210 4215 4220 

gtq tec gcg acc gag gee cga gtc gqc ccg ctg etc gat gqg gtq egg 

val Ser Ala Thr Glu Ala Arg val Gly Pro Leu Leu Asp Gly val Arg 
4225 4230 4235 4240 



12144 



gcg ctg gat gag egg gcg gtq gtq ctg gqg teg gat cgt gac acg ttg 12192 
Ala Leu Asp Glu Arg Ala val val Leu Gly Ser Asp Arg Asp Thr Leu 
4050 4055 4060 

ttg tct gqc ctg aat gcg ctg get gec gqt cat gag get get gqc gtq 12240 
Leu Ser Gly Leu Asn Ala Leu Ala Ala Gly His Glu Ala Ala Gly Val 
4065 4070 4075 4080 



12288 



gqt caa gqc gqt cag tgg ttg gqg atg gqc cgc egg ttg tac teg gag 12336 
Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Arg Leu Tyr Ser Glu 
4100 4105 4110 



12384 



12432 



12480 



12528 



12576 



12624 



12672 



12720 



gat cgt gtt gqt gtc gca gcg gtt aac get ccg gqg teg gtq gtq ctt 12768 
Asp Arg val Gly Val Ala Ala val Asn Ala Pro Gly Ser val val Leu 
4245 4250 4255 

tec gqt gac egg gat gtq etc gat gqc att gee gqt egg ctg gac gqg 12816 

Ser Gly Asp Arg Asp vaT Leu Asp Gly lie Ala Gly Arg Leu Asp Gly 
4260 4265 4270 
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caa gqt ate egg teg agg tgg ttg egg gtt teg cac gcg ttt cat teg 12864 

Gin Gly lie Arg Ser Arg Trp Leu Arg val Ser His Ala Phe His Ser 
4275 " 4280 4285 

cat egg atg gat ccg atg ctg gcg gag ttc gec gag etc gca egg age 12912 

His Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Ser 
4290 4295 4300 

gtq gac tac egg tct cca egg ctg ccg att gtc teg acg ctg acc gqa 12960 

Val Asp Tyr Arg ser Pro Arg Leu Pro lie val Ser Thr Leu Thr Gly 

4305 4310 4315 4320 

aac etc gat gac gtq gqc gtq atg get acg ccg gag tat tgg gtq cgc 13008 

Asn Leu Asp Asp val Gly Val Met Ala Thr Pro Glu Tyr Trp val Arg 

4325 4330 4335 

cag gtq cga gag ccc gtc cgc ttc gec gac gqt gtc cag gcg ctt gtq 13056 

Gin val Arg Glu Pro val Arg Phe Ala Asp Gly Val Gin Ala Leu val 
4340 ^ 4345 4350 

gac caa gqc gtc gac acg att gtq gaa etc gqt ccg gac gqg gcg ttg 13104 

Asp Gin Gly val Asp Thr lie val Glu Leu Gly Pro Asp Gly Ala Leu 
4355 4360 4365 

teg age ttg gtt caa gag tgt gtq gcg gag tec gqg egg gcg acg gqg 13152 

Ser Ser Leu val Gin Glu Cys Val Ala Glu Ser Gly Arg Ala Thr Gly 
4370 4375 4380 

att ccg ttg gtq egg aga gac cgt gat gag gtc cga acg gtq ctg gac 13200 

lie Pro Leu Val Arg Arg Asp Arg Asp Glu Val Arg Thr Val Leu Asp 

4385 4390 4395 4400 

get ttg gcg cag acc cac act cgt gqt gqc gcg gtq gac tgg gqg tea 13248 

Ala Leu Ala Gin Thr His Thr Arg Gly Gly Ala Val Asp Trp Gly Ser 

4405 4410 4415 

ttt ttc get gqt acg agg gca acg caa gtc gac ctt ccc acg tat gec 13296 

Phe Phe Ala Gly Thr Arg Ala Thr Gin Val Asp Leu Pro Thr Tyr Ala 
4420 4425 4430 

ttc caa cga cag egg tac tgg ctg gag cca teg gat tec gqt gat gtq 13344 

Phe Gin Arg Gin Arg Tyr Trp Leu Glu Pro Ser Asp Ser Gly Asp Val 
4435 4440 4445 

acc gqt gtt gqc ctg acc gqg gcg gag cat ccg ctg ttg gqt gec gtq 13392 

Thr Gly Val Gly Leu Thr Gly Ala Glu His Pro Leu Leu Gly Ala Val 
4450 4455 4460 

gtq ccg gtc gcg gqc gqc gat gag gtq ctg ctg acc gqc agg ctg teg 13440 

val Pro val Ala Gly Gly Asp Glu val Leu Leu Thr Gly Arg Leu Ser 

4465 4470 4475 ~ 4480 

gtq gqg acg cat ccg tgg ctg gcg gaa cac cgc gtq ctg gqc gaa gtc 13488 

val Gly Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly Glu val 

4485 4490 ~ 4495 

gtc gtc ccc gqc acc gcg ttg ctg gag atg gcg tgg egg gec gqt age 13536 

val val Pro Gly Thr Ala Leu Leu Glu Met Ala Trp Arg Ala Gly Ser 
4500 4505 4510 

cag gtc gqt tgt gaa cgt gtq gag gag etc acc ttg gag gca ccg ctg 13584 

Gin Val Gly Cys Glu Arg val Glu Glu Leu Thr Leu Glu Ala Pro Leu 
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4515 4520 4525 



gtc ctg ccg gag egg gqc get gcg gcg gtq cag ttg gcg gtq gqg get 13632 
val Leu Pro Glu Arg Gly Ala Ala Ala Val Gin Leu Ala Val Gly Ala 
4530 4535 4540 



ccg gat gag gee gqc egg cgc agt ttg cag etc tat tec cga gqc get 

Pro Asp Glu Ala Gly Arg Arg Ser Leu Gin Leu Tyr Ser Arg Gly Ala 

4545 4550 " 4555 " 4560 

gat gaa gac gqc gac tgg egg egg att gee tec gqg ctg ttg gee cag 

Asp Glu Asp Gly Asp Trp Arg Arg lie Ala Ser Gly Leu Leu Ala Gin 

4565 4570 4575 

gee aat gcg gtq ccg ccg gcg gat teg acg gca tgg ccg ccg gac gqc 

Ala Asn Ala val Pro Pro Ala Asp Ser Thr Ala Trp Pro Pro Asp Gly 

4580 4585 4590 

gec gqg cag gtc gat ctg gcg gag ttc tac gag cgc etc gec gag cgc 

Ala Gly Gin Val Asp Leu Ala Glu Phe Tyr Glu Arg Leu Ala Glu Arg 
4595 4600 4605 



cgt gqc gec cag ctg tac cgc get gaa gqa gca gcg ctt egg gta egg 

Arg Gly Ala Gin Leu Tyr Arg Ala Glu Gly Ala Ala Leu Arg Val Arg 
4675 4680 4685 

etc teg ccg ctg gqc tec gqt gca gtc tea ttg acg ttg gtq gat gee 

Leu ser Pro Leu Gly Ser Gly Ala val ser Leu Thr Leu val Asp Ala 
4690 4695 4700 

aca gqg cga cga gtc get gcg gtq gaa teg ctt teg acg cga ccg gtc 

Thr Gly Arg Arg val Ala Ala Val Glu Ser Leu Ser Thr Arg Pro val 

4705 " 4710 4715 ~ 4720 

tec ace gac cag ate gqt gec gqt cgc gqc gat caa gag egg ctg ctg 

Ser Thr Asp Gin lie Gly Ala Gly Arg Gly Asp Gin Glu Arg Leu Leu 

4725 4730 4735 



13680 



13728 



13776 



13824 



gqc ttg ace tac gqt ccg gta ttc caa gqg etc cgc gee gca tgg egg 13872 

Gly Leu Thr Tyr Gly Pro Val Phe Gin Gly Leu Arg Ala Ala Trp Arg 
4610 4615 4620 

cac gqc gac gat ate ttc gee gaa ttg gee gqg tea cca gac gee teg 13920 

His Gly Asp Asp lie Phe Ala Glu Leu Ala Gly Ser Pro Asp Ala Ser 
4625 4630 4635 4640 

gqt ttc gqc ate cac ccg gcg ctg ctg gac get gca ctg cac gcg atg 13968 

Gly Phe Gly lie His Pro Ala Leu Leu Asp Ala Ala Leu His Ala Met 
4645 4650 4655 

gcg ctt gqt get teg ccc gac teg gaa gcg cgt ctg ccg ttt tec tgg 14016 

Ala Leu Gly Ala ser Pro Asp Ser Glu Ala Arg Leu Pro Phe Ser Trp 
4660 4665 4670 



14064 



14112 



14160 



14208 



cac gtc gag tgg gta agg teg get gaa tct gcg gqg atg tct ctg acc 14256 

His val Glu Trp val Arg Ser Ala Glu Ser Ala Gly Met Ser Leu Thr 
4740 4745 4750 

tec tgc gcg gtq gtc gqt ttg gqc gaa ccg gag tgg cac get gcg ctg 14304 

Ser cys Ala Val val Gly Leu Gly Glu Pro Glu Trp His Ala Ala Leu 
4755 4760 4765 

aag acc act ggt gtc caa gtc gag tec cat gcg gac ctt get teg ttg 14352 
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Lys Thr Thr Gly val Gin val Glu Ser His Ala Asp Leu Ala Ser Leu 
4770 4775 4780 

gcc acc gag gtt gcc aag egg ggt tea get cct ggt gcg gtc ate gtc 14400 

Ala Thr Glu val Ala Lys Arg Gly Ser Ala Pro Gly Ala val He val 
4785 4790 4795 4800 

ccg tgc ccg cga ccc cga gcg atg cag gag ctg ccg acc gcc gcg cga 14448 

Pro Cys Pro Arg Pro Arg Ala Met Gin Glu Leu Pro Thr Ala Ala Arg 

4805 4810 4815 

agg gcg acg caa cag gcg atg gcg atg ctg cag caa tgg ctt gcc gat 14496 

Arg Ala Thr Gin Gin Ala Met Ala Met Leu Gin Gin Trp Leu Ala Asp 

4820 4825 4830 

gac egg ttc gtc agt acg cgc ctg ate ctg ctg acg cat egg gcg gtc 14544 

Asp Arg Phe val Ser Thr Arg Leu lie Leu Leu Thr His Arg Ala val 
4835 4840 4845 

tec gca gtt get gga gaa gac gtg etc gac ctg gta cac gcg ccg ctg 14592 

Ser Ala Val Ala Gly Glu Asp val Leu Asp Leu Val His Ala Pro Leu 
4850 4855 4860 

tgg ggc ttg gtc cgc age gcg caa gcg gag cac ccg gac cga ttc gcc 14640 

Trp Gly Leu Val Arg ser Ala Gin Ala Glu His Pro Asp Arg Phe Ala 

4865 4870 4875 ~ 4880 

ttg ate gat atg gac gac gag cga gca teg cag acg gca etc gcc gaa 14688 

Leu lie Asp Met Asp Asp Glu Arg Ala Ser Gin Thr Ala Leu Ala Glu 

4885 4890 4895 

gcg ctg act gcg gqa gaa gcg cag etc gcg gtq egg teg gqa gtt gtq 14736 

Ala Leu Thr Ala Gly Glu Ala Gin Leu Ala Val Arg Ser Gly val val 

4900 4905 4910 

ctg gcg ccc cgc etc ggc cag gtq aag gtq agt gqa gqt gaa gcg ttc 14784 

Leu Ala Pro Arg Leu Gly Gin Val Lys Val Ser Gly Gly Glu Ala Phe 
4915 4920 4925 

agg tgg gat gaa ggc acc gtg ctg gtc acc ggc gga acc ggc ggg etc 14832 

Arg Trp Asp Glu Gly Thr val Leu Val Thr Gly Gly Thr Gly Gly Leu 
4930 4935 4940 

ggg gcc ctg etc gca cgc cat ctg gtc age gcc cac ggt gtg egg cac 14880 

Gly Ala Leu Leu Ala Arg His Leu Val Ser Ala His Gly val Arg His 
4945 4950 4955 4960 

ctg ttg etc gca agt cgc cgt ggt ctg gcg gcg ccc gga gcg gat gag 14928 

Leu Leu Leu Ala Ser Arg Arg Gly Leu Ala Ala Pro Gly Ala Asp Glu 

4965 4970 4975 

ctg gtq gcc gag ctg gag cag gcc ggc gcc gac gtc gcg gtc gtc gcg 14976 

Leu VaT Ala Glu Leu Glu Gin Ala Gly Ala Asp Val Ala val val Ala 

4980 4985 4990 

tgc gac teg gca gat egg gac teg ctt gcg egg ctg gtg gcg teg gtg 15024 

Cys Asp Ser Ala Asp Arg Asp Ser Leu Ala Arg Leu Val Ala Ser Val 
4995 ~ 5000 5005 

cct gcg gaa aac ccg ttg egg gtg gtg gtg cac gcc gcc ggt gtg ctg 15072 

Pro Ala Glu Asn Pro Leu Arg Val Val Val His Ala Ala Gly Val Leu 
5010 5015 5020 
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gat gac gqt gtq ctg atg teg atg teg ccg gag cgc ttg gac gcg gtq 15120 
Asp Asp Gly val Leu Met Ser Met Ser Pro Glu Arg Leu Asp Ala val 
5025 5030 5035 5040 

ttg egg ccc aaa gtq gat gec gcg tgg tac ctg cac gag ctg act egg 15168 
Leu Arg Pro Lys Val Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg 
5045 5050 5055 

gaa etc gqt ctg teg gcg ttc gtq ttg ttc tec teg gtc gcg gqc ctg 15216 
Glu Leu Gly Leu Ser Ala Phe val Leu Phe Ser Ser Val Ala Gly Leu 
5060 5065 5070 

ttc gqc gqt gcg gqg cag age aat tac get gee gqc aac get ttc ctg 15264 
Phe Gly Gly Ala Gly Gin Ser Asn Tyr Ala Ala Gly Asn Ala Phe Leu 
5075 5080 5085 

gat gee ttg gcg cat tgc egg cag gee cag gqg ctg ccc gcg ctg teg 15312 
Asp Ala Leu Ala His Cys Arg Gin Ala Gin Gly Leu Pro Ala Leu Ser 
5090 5095 5100 

ctg gec tec gqg ctg tgg gcg agt ate gat gqa atg gcg gqc gac etc 15360 
Leu Ala Ser Gly Leu Trp Ala Ser lie Asp Gly Met Ala Gly Asp Leu 
5105 5110 5115 * 5120 

get gcg gca gat gtq gag egg ctg teg egg gca gqc att gqc ccg ctt 15408 
Ala Ala Ala Asp val Glu Arg Leu Ser Arg Ala Gly lie Gly Pro Leu 
5125 ' 5130 5135 

teg gca ccg gqa gqg ctg gec ttg ttc gac get gec gtt gqc teg gac 15456 
Ser Ala Pro Gly Gly Leu Ala Leu Phe Asp Ala Ala val Gly ser Asp 
5140 5145 5150 

gaa ccg ttg ctg gca ccg gtq cga ctg gat gtc gaa gca ctg cgt gtq 15504 
Glu Pro Leu Leu Ala Pro val Arg Leu Asp val Glu Ala Leu Arg val 
5155 5160 5165 

cag gec cga tec gtq cag ace egg att ccg gaa atg ctg cat gqc atg 15552 
Gin Ala Arg Ser val Gin Thr Arg lie Pro Glu Met Leu His Gly Met 
5170 ~ 5175 5180 

gca atg gqg cca age cgc cgc act ccg ttc act tec agg gtt gag ccg 15600 
Ala Met Gly Pro Ser Arg Arg Thr Pro Phe Thr Ser Arg Val Glu Pro 
5185 5190 5195 " 5200 

ttg cac gaa egg ctg gee gqa ttg teg gag gqc gaa cgt egg cag caa 15648 
Leu His Glu Arg Leu Ala Gly Leu Ser Glu Gly Glu Arg Arg Gin Gin 
5205 * 5210 5215 

gtq etc cag cgc gtc cgc gee gat ate gcg gtq gta ctg gqg cac gqc 15696 
val Leu Gin Arg val Arg Ala Asp lie Ala Val Val Leu Gly His Gly 
5220 5225 5230 

agg teg age gat gtq gac ate gag aag cct ttg gee gag ctg gqt ttc 15744 
Arg ser Ser Asp val Asp lie Glu Lys Pro Leu Ala Glu Leu Gly Phe 
5235 5240 5245 

gac teg ctg acg gec ate gaa etc cgc aac cgt etc get acc gec acc 15792 
Asp ser Leu Thr Ala lie Glu Leu Arg Asn Arg Leu Ala Thr Ala Thr 
5250 5255 5260 

gqa ctg egg ctt ccc gcg acg ctg gec ttc gac cac gqc act gcg gcg 15840 
Gly Leu Arg Leu Pro Ala Thr Leu Ala Phe Asp His Gly Thr Ala Ala 
5265 ~ 5270 5275 5280 
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gca etc gec cag cac gtg tgc gcg cag eta ggc acc gcg acc gcg ccg 15888 

Ala Leu Ala Gin His Val Cys Ala Gin Leu Gly Thr Ala Thr Ala Pro 
5285 5290 5295 

gca ccg agg cga acc gac gac aac gac gec acg gag ccc gtq agg teg 15936 

Ala Pro Arg Arg Thr Asp Asp Asn Asp Ala Thr Glu Pro Val Arg Ser 
5300 5305 5310 

etc ttc caa cag gcg tat gcg get gqc egg ata ctt gac ggg atg gat 15984 

Leu Phe Gin Gin Ala Tyr Ala Ala Gly Arg lie Leu Asp Gly Met Asp 
5315 5320 5325 

ttg gtq aag gtc get gec cag ttg cga ccg gtq ttc ggt teg cct gqc 16032 

Leu val Lys val Ala Ala Gin Leu Arg Pro val Phe Gly Ser Pro Gly 
5330 5335 5340 

gag ctg gaa tec ctg ccg aaa ccc gtc cag ctt tec cgt gqt ccc gaa 16080 

Glu Leu Glu Ser Leu Pro Lys Pro val Gin Leu Ser Arg Gly Pro Glu 
5345 5350 5355 5360 

gag ctt gec ttg gtq tgc atg ccg gcg ctg ate ggg atg ccg ccc gca 16128 

Glu Leu Ala Leu val Cys Met Pro Ala Leu lie Gly Met Pro Pro Ala 
5365 5370 5375 

cag cag tac gcg egg ate gec gec ggg ttc cgc gat gtg egg gac gtt 16176 

Gin Gin Tyr Ala Arg lie Ala Ala Gly Phe Arg Asp val Arg Asp val 
5380 5385 5390 

teg gtg ate ccg atg cct gga ttc att gcg gga gaa ccg ctg ccg tec 16224 

Ser val lie Pro Met Pro Gly Phe lie Ala Gly Glu Pro Leu Pro Ser 
5395 5400 5405 

gee ate gag gtq gcg gtt egg acg cag gcg gag gcg gtq ctg cag gaa 16272 

Ala lie Glu val Ala val Arg Thr Gin Ala Glu Ala Val Leu Gin Glu 
5410 5415 5420 

ttc gec ggg ggc teg ttc gta ctg gtc ggg cat tec tec ggg ggc tgg 16320 

Phe Ala Gly Gly Ser Phe Val Leu Val Gly His Ser Ser Gly Gly Trp 
5425 5430 5435 5440 

ctg gcg cac gag gta gee gqt gag ctg gag cgt cgc ggg gtc gtc ccg 16368 

Leu Ala His Glu val Ala Gly Glu Leu Glu Arg Arg Gly Val Val Pro 
5445 5450 ~ ' 5455 

gec ggg gtc gta ctg ctg gac acc tac ate ccc ggt gag ate acg ccg 16416 

Ala Gly val val Leu Leu Asp Thr Tyr lie Pro Gly Glu lie Thr Pro 
5460 5465 5470 

agg ttc tec gtg gcg atg gee cac egg acg tat gag aag etc gcg act 16464 

Arg Phe Ser val Ala Met Ala His Arg Thr Tyr Glu Lys Leu Ala Thr 
5475 5480 " 5485 

ttc acg gac atg cag gat gtc ggt ate acc gcg atg ggc ggg tac ttc 16512 

Phe Thr Asp Met Gin Asp val Gly lie Thr Ala Met Gly Gly Tyr Phe 
5490 5495 5500 

egg atg ttc acc gag tgg act ccg acg ccg ate ggt get ccg acg ctg 16560 

Arg Met Phe Thr Glu Trp Thr Pro Thr Pro lie Gly Ala Pro Thr Leu 
5505 5510 5515 5520 

ttc gtg egg acc gaa gat tgc gtc gca gac cct gaa ggg egg ccg tgg 16608 

Phe val Arg Thr Glu Asp Cys Val Ala Asp Pro Glu Gly Arg Pro Trp 

Page 228 



LeA33955-US 

5525 5530 5535 

aca gat gac tec tgg egg cca gqg tgg act etc gcg gat gec acg gtc 16656 
Thr Asp Asp Ser Trp Arg Pro Gly Trp Thr Leu Ala Asp Ala Thr Val 
5540 5545 5550 

cag gtq ccg gqc gac cac ttc teg atg atg gac gag cac gec gqg tec 16704 
Gin val Pro Gly Asp His Phe Ser Met Met Asp Glu His Ala Gly Ser 
5555 5560 5565 

ace gca cag gca gtc gcg agt tgg ctt gac aaa etc aac cag cgc ace 16752 
Thr Ala Gin Ala Val Ala Ser Trp Leu Asp Lys Leu Asn Gin Arg Thr 
5570 5575 5580 

get egg caa cgc tga 16767 

Ala Arg Gin Arg 

5585 

<210> 50 
<211> 5588 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 50 

Met Ala Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg val val val 
1 5 10 15 

Glu Leu Glu Glu Ala His Glu Arg Leu His Glu Leu Glu Arg Gin Glu 
20 25 30 

His Asp Pro lie Ala lie Val Ser Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 

Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Val Val Asp Gly Gly Asp 
50 55 60 

Ala lie Ala Asn Phe Pro Glu Asp Arg Gly Trp Asn Leu Asp Glu Leu 
65 70 "* 75 80 

Phe Asp Pro Asp Pro Gly Arg Ala Gly Thr Ser Tyr Val Arg Glu Gly 
85 90 95 

Gly Phe Leu Arg Gly Val Ala Asp Phe Asp Ala Gly Leu Phe Gly lie 
100 105 110 

Ser Pro Arg Glu Ala Gin Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu lie Ser Trp Glu Val Phe Glu Arg Ala Gly lie Asp Pro Phe Ser 
130 135 140 

Leu Arg Gly Thr Lys Thr Gly val Phe Ala Gly Leu lie Tyr His Asp 
145 ~ 150 ~ 155 160 

Tyr Ala Ser Arg Phe Arg Lys Thr Pro Ala Glu Phe Glu Gly Tyr Phe 
165 ' 170 175 

Ala Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg val Ala Tyr Thr 
180 185 " 190 

Phe Gly Leu Glu Gly Pro Ala val Thr val Asp Thr Ala Cys Ser Ser 
195 200 205 
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Ser Leu val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Leu Gly Glu 
210 215 220 

Cys Asp Leu Ala Leu Ala Gly Gly lie Ser Val Met Ala Thr Pro Gly 
225 230 235 240 

Ala Phe val Glu Phe Ser Arg Gin Arg Ala Leu Ala Ser Asp Gly Arg 
245 ~ 250 255 

cys Lys Pro Phe Ala Asp Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 

Ala Gly Met Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

His Pro val Leu Ala Ala val val Gly Ser Ala lie Asn Gin Asp Gly 
290 295 300 

Thr Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg val 
305 310 315 320 

lie Arg Gin Ala Leu Ala Asn Ala Gly Leu Ser Pro Ala Glu Val Asp 
325 330 335 

val val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro lie Glu 
340 345 350 

Ala Gin Ala Leu lie Ala Thr Tyr Gly Ala Asn Arg Ser Ala Asp His 
355 360 365 

Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

Ala Ala Gly val Ala Gly Val lie Lys Ser val Leu Ala lie Arg His 
385 390 395 400 

Arg Glu Met Pro Arg Ser Leu His lie Asp Gin Pro Ser Gin His val 
405 410 415 

Asp Trp Ser Ala Gly Ala val Arg Leu Leu Thr Asp Ser Val Asp Trp 
420 425 430 

Pro Asp Leu Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Met 
435 440 445 

Ser Gly Thr Asn Ala His Leu lie Val Glu Glu val Ser Asp Glu Pro 
450 455 460 

Val Ser Gly Ser Thr Glu Pro Thr Gly Ala Phe Pro Trp Pro Leu Ser 
465 470 475 480 

Gly Lys Thr Glu Thr Ala Leu Arg Glu Gin Ala Ala Glu Leu Leu Ser 
485 490 495 

Val val Thr Glu His Pro Glu Pro Gly Leu Gly Asp val Gly Tyr Ser 
500 505 510 

Leu Ala Thr Gly Arg Ala Ala Met Glu His Arg Ala Val Val Val Ala 
515 ~ 520 ~ 525 

Asp Asp Arg Asp Ser Phe val Ala Gly Leu Thr Ala Leu Ala Ala Gly 
530 535 540 
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val Pro Ala Ala Asn Val val Gin Gly Ala Ala Asp Cys Lys Gly Lys 
545 550 555 560 

Val Ala Phe val Phe Pro Gly Gin Gly Ser His Trp Gin Gly Met Ala 
565 570 575 

Arg Glu Leu Ser Glu Ser ser Pro Val Phe Arg Arg Lys Leu Ala Glu 
580 585 590 

Cys Ala Ala Ala Thr Ala Pro Tyr val Asp Trp ser Leu Leu Gly val 
595 600 605 

Leu Arg Gly Asp Pro Asp Ala Pro Ala Leu Asp Arg Asp Asp val lie 
610 615 620 

Gin Leu Ala Leu Phe Ala Met Met val Ser Leu Ala Glu Leu Trp Arg 
625 630 635 640 

Ser Cys Gly Val Glu Pro Ala Ala Val Val Gly His Ser Gin Gly Glu 
645 650 " 655 

lie Ala Ala Ala His val Ala Gly Ala Leu Ser Leu Thr Asp Ala val 
660 665 670 

Arg lie lie Ala Ala Arg Cys Asp Ala val Ser Ala Leu Thr Gly Lys 
675 680 685 

Gly Gly Met Leu Ala lie Ala Leu Pro Glu Ser Ala Val val Lys Arg 
690 695 700 

lie Ala Gly Leu Pro Glu Leu Thr val Ala Ala Val Asn Gly Pro Gly 
705 710 715 ' 720 

Ser Thr val Val Ser Gly Glu Pro Ser Ala Leu Glu Arg Leu Gin Thr 
725 730 735 

Glu Leu Thr Ala Glu Asn val Gin Thr Arg Arg val Gly lie Asp Tyr 
740 745 " 750 

Ala Ser His ser Pro Gin lie Ala Gin val Gin Gly Arg Leu Leu Asp 
755 760 765 

Arg Leu Gly Glu Val Gly Ser Glu Pro Ala Glu lie Ala Phe Tyr Ser 
770 775 780 

Thr val Thr Gly Glu Arg Thr Asp Thr Gly Arg Leu Asp Ala Asp Tyr 
785 790 795 800 

Trp Tyr Gin Asn Leu Arg Gin Pro Val Arg Phe Gin Gin Thr Val Ala 
805 810 815 

Arg Met Ala Asp Gin Gly Tyr Arg Phe Phe val Glu val Ser Pro His 
820 825 830 

Pro Leu Leu Thr Ala Gly lie Gin Glu Thr Leu Glu Ala Ala Asp Ala 
835 840 845 

Gly Gly val val val Gly ser Leu Arg Arg Gly Glu Gly Gly Ser Arg 
850 855 ~ " 860 

Arg Trp Leu Thr Ser Leu Ala Glu Cys Gin Val Arg Gly Leu Pro val 
865 870 875 ~ 880 
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Asn Trp Glu Gin Val Phe Leu Asn Thr Gly Ala Arg Arg val Pro Leu 
885 890 " 895 

pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Glu ser Ala Glu 
900 905 910 

Tyr Asp Ala Gly Asp Leu Gly Ser Val Gly Leu Leu Ser Ala Glu His 
915 920 925 

Pro Leu Leu Gly Ala Ala val Thr Leu Ala Asp Ala Gly Gly Phe Leu 
930 935 940 

Leu Thr Gly Lys Leu Ser Val Lys Thr Gin Pro Trp Leu Ala Asp His 
945 950 955 960 

val val Gly Gly Ala lie Leu Leu Pro Gly Thr Ala Phe val Glu Met 
965 970 975 

Leu lie Arg Ala Ala Asp Gin val Gly Cys Asp Leu lie Glu Glu Leu 
980 985 990 

Ser Leu Thr Thr Pro Leu val Leu Pro Ala Thr Gly Ala Val Gin Val 
995 1000 1005 

Gin lie Ala val Gly Gly Pro Asp Glu Ala Gly Arg Arg Ser val Arg 
1010 1015 1020 

val His Ser Cys Arg Asp Asp Ala Val Pro Gin Asp Ser Trp Thr Cys 
025 1030 1035 1040 

His Ala Thr Gly Thr Leu Thr Ser Ser Asp His Gin Asp Ala Gly Gin 
1045 1050 1055 

Gly Pro Asp Gly lie Trp Pro Pro Asn Asp Ala val Ala val Pro Leu 
1060 1065 1070 

Asp Ser Phe Tyr Ala Arg Ala Ala Glu Arg Gly Phe Asp Phe Gly Pro 
1075 ~ 1080 " 1085 

Ala Phe Gin Gly Leu Gin Ala Ala Trp Lys Arg Gly Asp Glu lie Phe 
1090 1095 1100 

Ala Glu Val Gly Leu Pro Thr Ala His Arg Glu Asp Ala Gly Arg Phe 
105 1110 1115 1120 

Gly lie His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Leu Gly Ala 
1125 1130 1135 

Ala Glu Glu Asp Pro Asp Glu Gly Trp Leu Pro Phe Ala Trp Gin Gly 
1140 1145 1150 

Val Ser Leu Lys Ala Thr Gly Ala Leu Ser Leu Arg val His Leu val 
1155 1160 1165 

Pro Ala Gly Ala Asn Ala Val ser val Phe Thr Thr Asp Thr Thr Gly 
1170 1175 1180 

Gin Ala val Leu Ser lie Asp Ser Leu Val Leu Arg Gin lie Ser Asp 
185 1190 1195 " 1200 

Lys Gin Leu Ala Ala Ala Arg Ala Met Glu His Glu Ser Leu Phe Arg 
1205 1210 1215 
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val Asp Trp Lys Arg lie Ser Pro Gly Ala Ala Lys Pro val Ser Trp 
1220 1225 1230 

Ala Val He Gly Asn Asp Glu Leu Ala Arg Ala Cys Gly Ser Ala Leu 
1235 1240 ~ 1245 

Gly Thr Glu Leu His Pro Asp Leu Thr Gly Leu Ala Asp Pro Pro Pro 
1250 1255 1260 

Asp val val val val Pro Cys Gly Ala Ser Arg Gin Asp Leu Asp val 
265 1270 1275 1280 

Ala Ser Glu Ala Arg Ala Ala Thr Gin Arg Met Leu Asp Leu lie Gin 
1285 1290 1295 

Asp Trp Leu Ala Ala Ala Arg Phe Ala Gly Ser Arg Leu val val val 
1300 1305 1310 

Thr Cys Gly Ala Ala Ser Thr Gly Pro Ala Glu Gly Val Ser Asp Leu 
1315 1320 1325 

Val His Ala Ala Ser Trp Gly Leu Leu Arg ser Ala Gin Ser Glu Asn 
1330 1335 1340 

Pro Asp Arg Phe Val Leu Val Asp Val Asp Gly Thr Ala Glu Ser Trp 
345 ~ 1350 1355 1360 

Arg Ala Leu Ala Ala Ala val Arg ser Gly Glu Pro Gin Leu Ala Leu 
1365 ~ 1370 1375 

Arg Ala Gly Glu val Arg Val Pro Arg Leu Ala Arg Cys Val Ala Ala 
1380 ~ 1385 1390 

Glu Asp Ser Arg lie Pro Val Pro Gly Ala Asp Gly Thr Val Leu lie 
1395 1400 1405 

Ser Gly Gly Thr Gly Leu Leu Gly Gly Leu Val Ala Arg His Leu val 
1410 1415 1420 

Ala Glu Arg Gly val Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp 
425 1430 1435 1440 

Ser Ala Pro Gly val Thr Asp Leu Val Asp Glu Leu val Gly Leu Gly 
1445 1450 1455 

Ala Ala Val Glu Val Ala Ser Cys Asp val Gly Asp Arg Ala Gin Leu 
1460 1465 * 1470 

Asp Arg Leu Leu Thr Thr lie Ser Ala Glu Phe Pro Leu Arg Gly val 
1475 1480 1485 

val His Ala Ala Gly Ala Leu Ala Asp Gly val val Glu Ser Leu Thr 
1490 1495 1500 

Pro Glu His val Ala Lys Val Phe Gly Pro Lys Ala Ala Gly Ala Trp 
505 1510 1515 1520 

His Leu His Glu Leu Thr Leu Asp Leu Asp Leu Ser Phe Phe Val Leu 
1525 1530 1535 

Phe Ser Ser Phe ser Gly Val Ala Gly Ala Ala Gly Gin Gly Asn Tyr 
1540 * 1545 1550 
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Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu Ala Gin His Arg Arg Thr 
1555 1560 1565 

Ala Gly Leu Pro Ala val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro 
1570 1575 1580 

Ser Gly Met Thr Gly Ala Leu Asp Ala Ala Gly Arg Ser Arg lie Ala 
585 1590 1595 ~ 1600 

Arg Thr Asn Pro Pro Met Ser Ala Pro Asp Gly Leu Arg Leu Phe Glu 
1605 1610 ~ 1615 

Met Ala Phe Arg val Pro Gly Glu Ser Leu Leu val Pro val His val 
1620 1625 1630 

Asp Leu Asn Ala Leu Arg Ala Asp Ala Ala Asp Gly Gly val Pro Ala 
1635 " 1640 1645 

Leu Leu Arg Asp Leu val Pro Ala Pro val Arg Arg Ser Ala val Asn 
1650 1655 1660 

Glu Ser Ala Asp Val Asn Gly Leu Val Gly Arg Leu Arg Arg Leu Pro 
665 1670 1675 ~ ~ 1680 

Asp Leu Asp Gin Glu Thr Gin Leu Leu Gly Leu val Arg Glu His Val 
1685 1690 ~ 1695 

Ser Ala Val Leu Gly His Ser Gly Ala val Glu val Gly Ala Asp Arg 
1700 1705 1710 

Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Phe Arg 
1715 1720 1725 

Asn Arg Leu Gly Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala val 
1730 1735 1740 

Phe Asp Tyr Pro Thr Pro Arg Ala Leu val Arg Phe Leu Leu Asp Lys 
745 1750 1755 1760 

Leu lie Gly Gly Val Glu Ala Pro Thr pro Ala Pro Ala Ala val Ala 
1765 1770 1775 

Ala val Thr Ala Asp Asp Pro Val Val lie val Gly Met Gly Cys Arg 
1780 1785 1790 

Tyr Pro Gly Gly val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala 
1795 1800 1805 

Gly Gly Leu Asp Ala val Ala Glu Phe Pro Asp Asp Arg Gly Trp Asp 
1810 1815 1820 

Gin Ala Gly Leu Phe Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr 
825 1830 1835 1840 

val Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Gly 
1845 1850 1855 

Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin 
1860 1865 1870 

Arg Leu Leu Leu Glu Val Ala Trp Glu Thr val Glu Arg Ala Gly lie 
1875 1880 1885 

Page 234 



LeA33955-US 

Asp Pro Leu Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu 
1890 1895 " 1900 

Met His His Asp Tyr Gly Ala Arg Phe lie Thr Arg Ala Pro Glu Gly 
905 1910 1915 1920 

Phe Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Gly val Phe Ser Gly 
1925 1930 1935 

Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala val Thr val Asp 
1940 1945 1950 

Thr Ala cys Ser Ser Ser Leu val Ala Leu His Leu Ala Gly Gin Ala 
1955 1960 1965 

Leu Arg ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly val Thr Val 
1970 1975 1980 

Met Ala Thr Pro Gly Met Phe val Glu Phe Ser Arg Gin Arg Gly Leu 
985 1990 1995 2000 

Ala Ala Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr 
2005 2010 2015 

Gly Trp Gly Glu Gly Ala Gly Leu val Leu Leu Glu Arg Leu Ser Asp 
2020 ' 2025 2030 

Ala Arg Arg Asn Gly His Ala Val Leu Ala val Val Arg Gly Ser Ala 
2035 2040 2045 

Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
2050 2055 2060 

Ser Gin Gin Arg val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser 
065 " 2070 2075 2080 

val Ser Asp val Asp Ala val Glu Ala His Gly Thr Gly Thr Arg Leu 
2085 2090 2095 

Gly Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Gin Gly 
2100 2105 2110 

Arg Asp Ser Asp Arg Pro Leu Trp Leu Gly Ser val Lys Ser Asn lie 
2115 2120 2125 

Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val 
2130 2135 2140 

Met Ala Met Arg His Gly Gin Leu Pro Ala Thr Leu His Val Asp Glu 
145 2150 2155 2160 

Pro Thr Ser Glu val Asp Trp Ser Ala Gly Asp val Gin Leu Leu Thr 
2165 2170 2175 

Glu Asn Thr Pro Trp Pro Gly Asn Ser His Pro Arg Arg val Gly Val 
2180 2185 2190 

Ser ser Phe Gly lie Ser Gly Thr Asn Ala His val lie Leu Glu Gin 
2195 2200 2205 

Ala Ser Lys Thr Pro Asp Glu Thr Ala Asp Lys Ser Gly Pro Asp Ser 
2210 2215 2220 
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Glu Ser Thr val Asp Leu Pro Ala val Pro Leu lie Val Ser Gly Arg 
225 2230 2235 2240 

Thr Pro Ala Ala Leu Ser Ala Gin Ala Ser Ala Leu Leu Ser Tyr Leu 
2245 2250 2255 

Gly Glu Arg Gly Asp lie Ser Thr Leu Asp Ala Ala Phe Ser Leu Ala 
2260 2265 2270 

Ser Ser Arg Ala Ala Leu Glu Glu Arg Ala val Val Leu Gly Ala Asp 
2275 2280 2285 

Arg Glu Thr Leu Leu Ser Gly Leu Glu Ala Leu Ala Ser Gly Arg Glu 
2290 2295 2300 

Ala Ser Gly val val Ser Gly Ser Pro val ser Gly Gly val Gly Phe 
305 2310 2315 2320 

Val Phe Ala Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu 
2325 2330 2335 

Tyr Ser val Phe Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala 
2340 2345 2350 

Gly Leu Asp Ala His Leu Gly Gin Asp val Gly Val Arg Asp Val val 
2355 2360 2365 

Phe Gly ser Asp Gly Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser 
2370 2375 2380 

Gly Leu Phe Ala Leu Gin val Gly Leu Leu Ser Leu Leu Gly Ser Trp 
385 2390 2395 2400 

Gly val Arg Pro Gly val val Leu Gly His Ser val Gly Glu Phe Ala 
2405 2410 2415 

Ala Ala val Ala Ala Gly val Leu Ser Leu Pro Asp Ala Ala Arg Met 
2420 2425 2430 

val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala 
2435 ~ 2440 2445 

Met Leu Ala Val Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala 
2450 2455 2460 

Asp Arg Val Asp Gly Ala Gly lie Ala Ala Val Asn Ala Pro Glu Ser 
465 2470 2475 2480 

val val Leu Ser Gly Asp Arg Glu val Leu Asp Asp lie Ala Gly Ala 
2485 2490 2495 

Leu Asp Gly Gin Gly lie Arg Trp Arg Arg Leu Arg val Ser His Ala 
2500 2505 2510 

Phe His Ser Tyr Arg Met Asp Pro Met Leu Gin Glu Phe Ala Glu lie 
2515 2520 2525 

Ala Arg Ser val Asp Tyr Arg Arg Gly Asp Leu Pro val Val Ser Thr 
2530 2535 ' 2540 

Leu Thr Gly Glu Leu Asp Thr Ala Gly val Met Ala Thr Pro Glu Tyr 
545 2550 2555 2560 
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Trp val Arg Gin val Arg Glu Pro val Arg Phe Ala Asp Gly val Arg 
2565 2570 2575 

val Leu Ala Gin Gin Gly val Ala Thr lie Phe Glu Leu Gly Pro Asp 
2580 2585 2590 

Ala Thr Leu ser Ala Leu He Pro Asp Cys His Ser Trp Ala Asp Gin 
2595 2600 2605 

Ala Met Pro lie Pro Met Leu Arg Lys Asp Arg Thr Glu Thr Glu Thr 
2610 2615 2620 

val val Ala Ala val Ala Arg Ala His Thr Arg Gly val Pro Val Glu 
625 2630 2635 2640 

Trp ser Ala Tyr Phe Ala Gly Thr Gly Ala Arg Arg val Glu Leu Pro 
2645 2650 " 2655 

Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Glu Thr Ser Asp Tyr 
2660 2665 2670 

Gly Asp val Thr Gly lie Gly Leu Ala Ala Ala Glu His Pro Leu Leu 
2675 2680 2685 

Gly Ala val val Ala Leu Ala Asp Gly Asp Gly Met val Leu Thr Gly 
2690 2695 2700 

Arg Leu ser val Gly Thr His Pro Trp Leu Ala Gin His Arg val Leu 
705 2710 2715 2720 

Gly Glu val val val Pro Gly Thr Ala lie Leu Glu Met Ala Leu His 
2725 2730 2735 

Ala Gly Ala Arg Leu Gly Cys Asp Arg val Glu Glu Leu Thr Leu Glu 
2740 2745 2750 

Thr Pro Leu Val Val Pro Glu Arg Ala Ala Gly Ala Gly Ser Arg Gly 
2755 2760 ' 2765 

Pro Ala Gly Gly Thr Thr val Ser lie Glu Thr Ala Glu Glu Arg val 
2770 ' 2775 2780 

Arg Thr Asn Asp Ala lie Glu lie Gin Leu Leu Val Asn Ala Pro Asp 
785 2790 2795 2800 

Glu Gly Gly Arg Arg Arg val Ser Leu Tyr ser Arg Pro Ala Gly Gly 
2805 2810 2815 

Ser Arg Gly Gly Gly Trp Thr Arg His Ala Thr Gly Glu Leu Val val 
2820 2825 2830 

Gly Thr Thr Gly Gly Arg Ala val Pro Asp Trp Ser Ala Glu Gly Ala 
2835 ~ 2840 2845 

Glu ser lie Ala Leu Asp Glu Phe Tyr val Ala Leu Ala Gly Asn Gly 
2850 2855 2860 

Phe Glu Tyr Gly Pro Leu Phe Gin Gly Leu Gin Ala Ala Trp Arg Arg 
865 2870 2875 2880 

Gly Asp Glu Val Leu Ala Glu lie Ala Pro Pro Ala Glu Ala Asp Ala 
2885 2890 2895 
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Met Ala Ser Gly Tyr Leu Leu Asp Pro Ala Leu Leu Asp Ala Ala Leu 
2900 2905 2910 

Gin Ala Ser Ala Leu Gly Asp Arg Pro Glu Gin Gly Gly Ala Trp Leu 
2915 2920 2925 

Pro Phe ser Phe Thr Gly Val Glu Leu Ser Ala Pro Ala Gly Thr lie 
2930 2935 2940 

Ser Arg val Arg Leu Glu Thr Arg Arg Pro Asp Ala lie Ser val Ala 
945 " 2950 " 2955 2960 

val Met Asp Glu ser Gly Arg Leu Leu Ala Ser lie Asp Ser Leu Arg 
2965 2970 2975 

Leu Arg ser val Ser Ser Gly Gin Leu Ala Asn Arg Asp Ala Val Arg 
2980 ' 2985 2990 

Asp Ala Leu Phe Glu val Thr Trp Glu Pro val Ala Thr Gin ser Thr 
2995 3000 3005 

Glu Pro Gly Arg Trp Ala Leu Leu Gly Asp Thr Ala Cys Gly Lys Asp 
3010 3015 3020 

Asp Leu lie Lys Leu Ala Thr Asp Ser Ala Asp Arg cys Ala Asp Leu 
025 3030 3035 3040 

Ala Ala Leu Ala Glu Lys Leu Asp Ser Ser Ala Leu Val Pro Asp Val 
3045 3050 3055 

val val Tyr cys Ala Gly Glu Gin Ala Asp Pro Gly Thr Gly Ala Ala 
3060 3065 3070 

Ala Leu Ala Glu Thr Gin Gin Thr Leu Ala Leu Leu Gin Ala Trp Leu 
3075 3080 3085 

Ala Glu Pro Arg Leu Ala Glu Ala Arg Leu val val Val Thr cys Ala 
3090 ~ 3095 3100 

Ala val Thr Thr Ala Pro Ser Asp Gly Ala Ser Glu Leu Ala His Ala 
105 3110 3115 3120 

Pro Leu Trp Gly Leu Leu Arg Ala Ala Gin Val Glu Asn Pro Gly Gin 
3125 " 3130 3135 

Phe Val Leu Ala Asp val Asp Gly Thr Ala Glu Ser Trp Arg Ala Leu 
3140 3145 3150 

Pro Ser Ala Leu Gly Ser Met Glu Pro Gin Leu Ala Leu Arg Lys Gly 
3155 3160 3165 

Ala val Arg Ala Pro Arg Leu Ala Ser val Ala Gly Gin lie Asp val 
3170 3175 3180 

Pro Ala val val Ala Asp Pro Asp Arg Thr val Leu lie ser Gly Gly 
185 3190 ~ 3195 3200 

Thr Gly Leu Leu Gly Gly Ala val Ala Arg His Leu Val Thr Glu Arg 
3205 3210 3215 

Gly Val Arg Arg Leu val Leu Thr Gly Arg Arg Gly Trp Asp Ala Pro 
3220 3225 3230 
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Gly lie Thr Glu Leu Val Gly Glu Leu Asn Gly Leu Gly Ala Val val 
3235 3240 3245 

Asp val val Ala cys Asp val Ala Asp Arg Ala Asp Leu Glu Ser Leu 
3250 3255 " 3260 

Leu Ala Ala val Pro Ala Glu Phe Pro Leu Cys Gly val Val His Ala 
265 3270 3275 3280 

Ala Gly Ala Leu Ala Asp Gly val lie Glu Ser Leu Ser pro Asp Asp 
3285 3290 3295 

val Gly Ala val Phe Gly Pro Lys Ala Ala Gly Ala Trp Asn Leu His 
3300 3305 3310 

Glu Leu Thr Arg Asp Thr Asp Leu Ser Phe Phe Ala Leu Phe Ser ser 
3315 3320 3325 

Leu Ser Gly val Ala Gly Ala Pro Gly Gin Gly Asn Tyr Ala Ala Ala 
3330 3335 3340 

Asn Ala Phe Leu Asp Ala Leu Ala His Tyr Arg Arg ser Gin Gly Leu 
345 3350 3355 " 3360 

Pro Ala val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro Ser Gly Met 
3365 3370 3375 

Thr Glu Thr Leu Ser Glu Val Asp Arg Ser Arg lie Ala Arg Ala Asn 
3380 3385 3390 

Pro Pro Leu Ser Thr Lys Glu Gly Leu Arg Leu Phe Asp Ala Gly Leu 
3395 3400 3405 

Ala Leu Asp Arg Ala Ala Val Val Pro Ala Lys Leu Asp Arg Thr Phe 
3410 3415 3420 

Leu Ala Glu Gin Ala Arg ser Gly Ser Leu Pro Ala Leu Leu Thr Ala 
425 3430 3435 3440 

Leu val Pro Pro lie Arg Arg Asn Arg Arg Ala Ser Gly Thr Glu Leu 
3445 " 3450 3455 

Ala Asp Glu Gly Thr Leu Leu Gly Val Val Arg Glu His Ala Ala Ala 
3460 3465 3470 

Val Leu Gly Tyr Ser Ser Ala Ala Asp Val Gly val Glu Arg Ala Phe 
3475 3480 3485 

Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly val Glu Leu Arg Asn Arg 
3490 3495 3500 

Leu Ala Gly val Leu Gly val Arg Leu Pro Ala Thr Ala val Phe Asp 
505 3510 3515 3520 

Tyr Pro Thr pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu Leu Ala 
3525 3530 3535 

Asp Glu lie Ala Thr Thr Pro Ala Pro Val Thr Thr Thr Arg Ala Pro 
3540 3545 3550 

Val Ala Glu Asp Asp Leu val Ala lie val Gly Met Gly Cys Arg Phe 
3555 3560 3565 
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Pro Gly Gin val Ser ser Pro Glu Glu Leu Trp Arg Leu val Ala Gly 
3570 3575 3580 

Gly val Asp Ala val Ala Asp Phe Pro Ala Asp Arg Gly Trp Asp Leu 
585 3590 3595 3600 

Ala Gly Leu Phe Asp Pro Asp Pro Glu Arg Ala Gly Lys Thr Tyr Val 
3605 3610 3615 

Arg Glu Gly Ala Phe Leu Thr Asp Ala Asp Arg Phe Asp Ala Gly Phe 
3620 3625 3630 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
3635 3640 3645 

Leu Leu Leu Glu Leu Ser Trp Glu Ala lie Glu Arg Ala Gly lie Asp 
3650 3655 3660 

Pro Gly ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met 
665 3670 ~ 3675 3680 

Tyr His Asp Tyr Gly Ala Arg Phe Ala Ser Arg Ala Pro Glu Gly Phe 
3685 3690 3695 

Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Ser val Ala Ser Gly Arg 
3700 3705 3710 

He Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr val Asp Thr 
3715 3720 3725 

Ala cys Ser ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ser Leu 
3730 3735 3740 

Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly val Thr val Met 
745 3750 3755 3760 

Ser Thr Pro Gly Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
3765 3770 " 3775 

Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala Asp Gly Thr Gly 
3780 3785 3790 

Trp Gly Glu Gly Ala Gly Leu val Leu Leu Glu Arg Leu Ser Asp Ala 
3795 3800 3805 

Arg Arg Asn Gly His Arg val Leu Ala Val Val Arg Gly Ser Ala val 
3810 3815 3820 

Asn Gin Asp Gly Ala ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
825 3830 3835 3840 

Gin Gin Arg val lie Gin Gin Ala Leu Ala Ser Ala Gly Leu Ser Val 
3845 3850 3855 

Ser Asp Val Asp Ala val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
3860 3865 3870 

Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg 
3875 3880 - 3885 

Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser val Lys Ser Asn lie Gly 
3890 3895 3900 
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His Thr Gin Ala Ala Ala Gly val Ala Gly val lie Lys Met val Met 
905 3910 3915 3920 

Ala Met Arg His Gly Gin Leu Pro Arg Thr Leu His Val Asp Ala Pro 
3925 3930 3935 

ser ser Gin val Asp Trp Ser Ala Gly Arg val Gin Leu Leu Thr Glu 
3940 3945 3950 

Asn Thr Pro Trp Pro Asp Ser Gly Arg Pro Cys Arg val Gly val Ser 
3955 3960 3965 

Ser Phe Gly He Ser Gly Thr Asn Ala His val lie Leu Glu Gin Ser 
3970 3975 3980 

Thr Gly Gin Met Asp Gin Ala Ala Glu Pro Asp ser ser Pro val Leu 
985 3990 3995 4000 

Asp val pro Val Val Pro Trp val val Ser Gly Lys Thr Pro Glu Ala 
4005 4010 4015 

Leu Ser Ala Gin Ala Ala Thr Leu Ala Thr Tyr Leu Asp Gin Asn val 
4020 4025 4030 

Asp val Ser Pro Leu Asp val Gly lie Ser Leu Ala val Thr Arg Ser 
4035 4040 4045 

Ala Leu Asp Glu Arg Ala val Val Leu Gly Ser Asp Arg Asp Thr Leu 
4050 4055 4060 

Leu Ser Gly Leu Asn Ala Leu Ala Ala Gly His Glu Ala Ala Gly Val 
065 4070 4075 4080 

val Thr Gly Pro val Gly lie Gly Gly Arg Thr Gly Phe val Phe Ala 
4085 4090 4095 

Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Arg Leu Tyr Ser Glu 
4100 4105 "* 4110 

Phe Pro Ala Phe Ala Gly Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp 
4115 4120 4125 

Ala Asn Leu Gly Arg Glu val Gly Val Arg Asp val val Phe Gly ser 
4130 ~ 4135 ~ 4140 

Asp Glu Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe 
145 4150 ~ 4155 4160 

Ala Leu Gin val Gly Leu Trp Glu Leu Leu Gly Thr Trp Gly Val Arg 
4165 4170 4175 

Pro Ser Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe 
4180 4185 4190 

Ala Ala Gly Val Leu Ser Met Ala Glu Ala Ala Arg Leu Val Ala Gly 
4195 4200 4205 

Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala 
4210 " 4215 4220 

val Ser Ala Thr Glu Ala Arg val Gly Pro Leu Leu Asp Gly Val Arg 
225 4230 ~ ' 4235 4240 
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Asp Arg val Gly val Ala Ala val Asn Ala Pro Gly ser val val Leu 
4245 4250 4255 

Ser Gly Asp Arg Asp val Leu Asp Gly lie Ala Gly Arg Leu Asp Gly 
4260 4265 4270 

Gin Gly He Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser 
4275 * 4280 4285 

His Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Ser 
4290 4295 4300 

val Asp Tyr Arg ser Pro Arg Leu Pro lie val Ser Thr Leu Thr Gly 
305 4310 " 4315 4320 

Asn Leu Asp Asp val Gly val Met Ala Thr Pro Glu Tyr Trp val Arg 
4325 4330 4335 

Gin val Arg Glu Pro val Arg Phe Ala Asp Gly val Gin Ala Leu val 
4340 4345 4350 

Asp Gin Gly val Asp Thr lie val Glu Leu Gly Pro Asp Gly Ala Leu 
4355 4360 4365 

Ser ser Leu val Gin Glu Cys val Ala Glu Ser Gly Arg Ala Thr Gly 
4370 4375 4380 

lie Pro Leu val Arg Arg Asp Arg Asp Glu val Arg Thr val Leu Asp 
385 4390 " 4395 ~ 4400 

Ala Leu Ala Gin Thr His Thr Arg Gly Gly Ala val Asp Trp Gly Ser 
4405 4410 4415 

Phe Phe Ala Gly Thr Arg Ala Thr Gin val Asp Leu Pro Thr Tyr Ala 
4420 " 4425 4430 

Phe Gin Arg Gin Arg Tyr Trp Leu Glu Pro Ser Asp Ser Gly Asp Val 
4435 4440 4445 

Thr Gly val Gly Leu Thr Gly Ala Glu His Pro Leu Leu Gly Ala Val 
4450 4455 4460 

val Pro val Ala Gly Gly Asp Glu val Leu Leu Thr Gly Arg Leu Ser 
465 4470 4475 4480 

val Gly Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly Glu Val 
4485 4490 4495 

Val val Pro Gly Thr Ala Leu Leu Glu Met Ala Trp Arg Ala Gly Ser 
4500 4505 4510 

Gin Val Gly Cys Glu Arg val Glu Glu Leu Thr Leu Glu Ala Pro Leu 
4515 4520 4525 

Val Leu Pro Glu Arg Gly Ala Ala Ala val Gin Leu Ala val Gly Ala 
4530 ~ 4535 4540 

Pro Asp Glu Ala Gly Arg Arg Ser Leu Gin Leu Tyr ser Arg Gly Ala 
545 4550 4555 " 4560 

Asp Glu Asp Gly Asp Trp Arg Arg lie Ala Ser Gly Leu Leu Ala Gin 
4565 " 4570 4575 
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Ala Asn Ala val Pro Pro Ala Asp Ser Thr Ala Trp Pro Pro Asp Gly 
4580 4585 4590 

Ala Gly Gin val Asp Leu Ala Glu Phe Tyr Glu Arg Leu Ala Glu Arg 
4595 4600 4605 

Gly Leu Thr Tyr Gly Pro val Phe Gin Gly Leu Arg Ala Ala Trp Arg 
4610 4615 4620 

His Gly Asp Asp lie Phe Ala Glu Leu Ala Gly Ser Pro Asp Ala Ser 
625 4630 4635 4640 

Gly Phe Gly lie His Pro Ala Leu Leu Asp Ala Ala Leu His Ala Met 
4645 4650 4655 

Ala Leu Gly Ala Ser Pro Asp Ser Glu Ala Arg Leu Pro Phe Ser Trp 
4660 4665 4670 

Arg Gly Ala Gin Leu Tyr Arg Ala Glu Gly Ala Ala Leu Arg val Arg 
4675 4680 4685 

Leu Ser Pro Leu Gly Ser Gly Ala val Ser Leu Thr Leu Val Asp Ala 
4690 4695 4700 

Thr Gly Arg Arg val Ala Ala Val Glu Ser Leu Ser Thr Arg Pro val 
705 4710 4715 ~ 4720 

Ser Thr Asp Gin lie Gly Ala Gly Arg Gly Asp Gin Glu Arg Leu Leu 
4725 4730 4735 

His Val Glu Trp Val Arg Ser Ala Glu Ser Ala Gly Met Ser Leu Thr 
4740 4745 4750 

Ser cys Ala val val Gly Leu Gly Glu Pro Glu Trp His Ala Ala Leu 
4755 4760 4765 

Lys Thr Thr Gly val Gin val Glu Ser His Ala Asp Leu Ala Ser Leu 
4770 4775 4780 

Ala Thr Glu Val Ala Lys Arg Gly ser Ala Pro Gly Ala Val lie val 
785 4790 4795 4800 

Pro Cys Pro Arg Pro Arg Ala Met Gin Glu Leu Pro Thr Ala Ala Arg 
4805 4810 4815 

Arg Ala Thr Gin Gin Ala Met Ala Met Leu Gin Gin Trp Leu Ala Asp 
4820 4825 4830 

Asp Arg Phe val ser Thr Arg Leu lie Leu Leu Thr His Arg Ala val 
4835 4840 4845 

Ser Ala val Ala Gly Glu Asp Val Leu Asp Leu Val His Ala Pro Leu 
4850 4855 4860 

Trp Gly Leu val Arg Ser Ala Gin Ala Glu His Pro Asp Arg Phe Ala 
865 4870 4875 "* 4880 

Leu lie Asp Met Asp Asp Glu Arg Ala Ser Gin Thr Ala Leu Ala Glu 
4885 ~ 4890 4895 

Ala Leu Thr Ala Gly Glu Ala Gin Leu Ala Val Arg Ser Gly Val Val 
4900 4905 ~ 4910 
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Leu Ala Pro Arg Leu Gly Gin val Lys val Ser Gly Gly Glu Ala Phe 
4915 ~ 4920 4925 

Arg Trp Asp Glu Gly Thr val Leu val Thr Gly Gly Thr Gly Gly Leu 
4930 4935 4940 

Gly Ala Leu Leu Ala Arg His Leu Val Ser Ala His Gly Val Arg His 
945 4950 4955 4960 

Leu Leu Leu Ala Ser Arg Arg Gly Leu Ala Ala Pro Gly Ala Asp Glu 
4965 ~ 4970 4975 

Leu Val Ala Glu Leu Glu Gin Ala Gly Ala Asp val Ala val val Ala 
4980 4985 4990 

Cys Asp ser Ala Asp Arg Asp Ser Leu Ala Arg Leu val Ala Ser val 
4995 5000 5005 

Pro Ala Glu Asn Pro Leu Arg val val Val His Ala Ala Gly val Leu 
5010 5015 5020 

Asp Asp Gly val Leu Met Ser Met Ser Pro Glu Arg Leu Asp Ala Val 
025 5030 5035 5040 

Leu Arg Pro Lys Val Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg 
5045 5050 5055 

Glu Leu Gly Leu ser Ala Phe val Leu Phe Ser ser val Ala Gly Leu 
5060 5065 5070 

Phe Gly Gly Ala Gly Gin Ser Asn Tyr Ala Ala Gly Asn Ala Phe Leu 
5075 5080 5085 

Asp Ala Leu Ala His Cys Arg Gin Ala Gin Gly Leu Pro Ala Leu Ser 
5090 5095 5100 

Leu Ala Ser Gly Leu Trp Ala Ser lie Asp Gly Met Ala Gly Asp Leu 
105 5110 5115 5120 

Ala Ala Ala Asp val Glu Arg Leu ser Arg Ala Gly lie Gly Pro Leu 
5125 5130 5135 

Ser Ala Pro Gly Gly Leu Ala Leu Phe Asp Ala Ala val Gly Ser Asp 
5140 5145 5150 

Glu Pro Leu Leu Ala Pro Val Arg Leu Asp val Glu Ala Leu Arg Val 
5155 5160 5165 

Gin Ala Arg Ser val Gin Thr Arg lie Pro Glu Met Leu His Gly Met 
5170 5175 " 5180 

Ala Met Gly Pro Ser Arg Arg Thr Pro Phe Thr ser Arg val Glu Pro 
185 5190 5195 5200 

Leu His Glu Arg Leu Ala Gly Leu Ser Glu Gly Glu Arg Arg Gin Gin 
5205 5210 5215 

val Leu Gin Arg val Arg Ala Asp lie Ala Val Val Leu Gly His Gly 
5220 5225 5230 

Arg Ser Ser Asp val Asp lie Glu Lys Pro Leu Ala Glu Leu Gly Phe 
5235 5240 5245 
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Asp Ser Leu Thr Ala lie Glu Leu Arg Asn Arg Leu Ala Thr Ala Thr 
5250 5255 ~ 5260 

Gly Leu Arg Leu Pro Ala Thr Leu Ala Phe Asp His Gly Thr Ala Ala 
265 ~ 5270 5275 5280 

Ala Leu Ala Gin His val Cys Ala Gin Leu Gly Thr Ala Thr Ala Pro 
5285 5290 5295 

Ala Pro Arg Arg Thr Asp Asp Asn Asp Ala Thr Glu Pro val Arg Ser 
5300 5305 5310 

Leu Phe Gin Gin Ala Tyr Ala Ala Gly Arg lie Leu Asp Gly Met Asp 
5315 5320 5325 

Leu Val Lys Val Ala Ala Gin Leu Arg Pro Val Phe Gly Ser Pro Gly 
5330 5335 5340 

Glu Leu Glu Ser Leu Pro Lys Pro Val Gin Leu Ser Arg Gly Pro Glu 
345 5350 5355 " 5360 

Glu Leu Ala Leu val Cys Met Pro Ala Leu lie Gly Met Pro Pro Ala 
5365 5370 5375 

Gin Gin Tyr Ala Arg lie Ala Ala Gly Phe Arg Asp val Arg Asp Val 
5380 5385 ~ 5390 

Ser val lie Pro Met Pro Gly Phe lie Ala Gly Glu Pro Leu Pro Ser 
5395 5400 5405 

Ala lie Glu val Ala val Arg Thr Gin Ala Glu Ala val Leu Gin Glu 
5410 5415 5420 

Phe Ala Gly Gly ser Phe Val Leu Val Gly His Ser Ser Gly Gly Trp 
425 5430 5435 5440 

Leu Ala His Glu Val Ala Gly Glu Leu Glu Arg Arg Gly Val Val Pro 
5445 5450 ' 5455 

Ala Gly val Val Leu Leu Asp Thr Tyr lie Pro Gly Glu lie Thr pro 
5460 5465 5470 

Arg Phe Ser Val Ala Met Ala His Arg Thr Tyr Glu Lys Leu Ala Thr 
5475 5480 ~ 5485 

Phe Thr Asp Met Gin Asp Val Gly lie Thr Ala Met Gly Gly Tyr Phe 
5490 5495 5500 

Arg Met Phe Thr Glu Trp Thr Pro Thr Pro lie Gly Ala Pro Thr Leu 
505 5510 5515 5520 

Phe val Arg Thr Glu Asp Cys Val Ala Asp Pro Glu Gly Arg Pro Trp 
5525 5530 ' 5535 

Thr Asp Asp Ser Trp Arg Pro Gly Trp Thr Leu Ala Asp Ala Thr val 
5540 " 5545 5550 

Gin val Pro Gly Asp His Phe Ser Met Met Asp Glu His Ala Gly Ser 
5555 5560 5565 

Thr Ala Gin Ala val Ala Ser Trp Leu Asp Lys Leu Asn Gin Arg Thr 
5570 5575 5580 

Page 245 



LeA33955-US 



Ala Arg Gin Arg 
585 



<210> 51 
<211> 2363 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 51 

cccgggccgg ccgtgggcgc cgggtccgcg gggcggaacg ggtctgccgg gctgaccctg 60 
cgggccgcct ggccacctcg gccggtcctc cacgaatcct ccttggcctg ccggcgggcg 120 
ttacagtttc gggaagtgat tttgctcgtg ttccgaatgc aggctagcgg tgttcctggg 180 
gcggttgggc aggtcccagc aacagtggtg atatccctca taagggcgaa gcgacttcgt 240 
cacgttgcgt aatgcgggat cctgcttcgt agctcggtgt gtcatgccag actgcgcacg 300 
cggacctgca gcgggccgcg aaatcccggc gaggaagggc gcgatgcgga ttctggtcac 360 
cggcggagcc ggtttcatcg gctcgcacta cgttcggcag ttgctcggtg gtgcgtaccc 420 
cgcattcgcc gacgccgacg tggtcgtgct cgacaagctc acctacgccg gcaacgaggc 480 
gaacctggcg ccggtcgcgg acaacccccg gctgaagttc gtctgcggcg acatctgcga 540 
ccgcgaactg gttggcggcc tgatgtccgg cgtggacgtg gtggtgcact tcgccgccga 600 
aacccacgtc gaccgctcga tcaccggctc ggacgccttc gtgatcacca acgtggtcgg 660 
caccaacgtg ctgctgcagg ccgcgctcga cgccgagatc ggcaagttcg tgcacgtttc 720 
caccgacgag gtctacggct ccatcgagga cggctcgtgg cccgaagacc acgcgctgga 780 
gccgaattcc ccgtactcgg cggcgaaagc gggctcggac ctgctggccc gcgcctacca 840 
ccgcacccac ggactgccgg tgtgcatcac ccgctgctcc aacaactacg ggccctacca 900 
gttcccggag aaggtgctgc cgctgttcat cacgaacctg atggacggca gccaggtgcc 960 
gctctacggc gacgggctca acgtgcggga ctggctgcac gtcagcgacc actgccgggg 1020 
catccagctg gtggccgact ccgggcgcgc gggcgagatc tacaacatcg gcggcggcac 1080 
cgagctgacc aacaacgagc tgaccgagcg gctgctggca gagctgggcc tcgactggtc 1140 
ggtggtgcgg ccggtcaccg accgcaaggg ccacgaccgc cgctactcgg tggaccacag 1200 
caagatcgtc gaggaactgg ggtacgcgcc gcaggtcgac ttcgagaccg ggctgcgcga 1260 
gacaatccgc tggtaccagg acaaccggga ctggtgggag ccgctgaagg cccgatcggc 1320 
ggtggctcga tgagtcgcct cgccgtgctg gtgcccggcg gccgcggcca gctgggctcg 1380 
gagctggccc ggatcctccc cgcgcggacg ggggcgctgg tgcaccggcc gggttccggg 1440 
gaactggacg tcaccgacgc cgaggaggtc gccgacgcgt tgggttcctt cgcggagacg 1500 
gcgaaggacg cggagctgcg accggtggtg atcaacgccg cggcgtacac ggcggtggac 1560 
gcggccgagt ccgacccgga ccgcgcggcc cggatcaacg ccgaaggcgc ggcctcgctg 1620 
gcgaaagcgt gccggagcag cggtctgccc ctggtgcacg tgtcgacgga ttacgtgttc 1680 
cccggtgatg gggcccggcc gtacgagccg acggacccga ccgggccgcg atcggtctac 1740 
gggcgcacca agctcgaagg cgaacgggcc gtgctggagt ccggcgcgcg ggcctgggtg 1800 
gtgcgcacgg catgggtgta cggcgcgagc ggcaagaact tcctgaaaac gatgatccgc 1860 
ctctcggggg agcgcgacac gctgtccgtt gtggacgatc agatcggctc gccgacttgg 1920 
gcggcggacc tggcgagcgg cctgctggag ctggccgaac gggtcgccga acgccgtgga 1980 
ccggagcaga aggtgctgca ctgcaccaat tccggccagg tgacctggta cgagttcgcg 2040 
cgggcgatct tcgcggaatt cggcctggac gagaaccgcg tccacccgtg cacgacggcg 2100 
gacttccccc tcccggcgca ccgcccggcc tactcggtcc tgtccgacgt ggcgtggcga 2160 
gaggcgggcc tgaccccgat gcgcacctgg cgggaagccc tggcggcggc cttcgagaaa 2220 
gacggcgaaa ccctccgaac ccgctgacca gtcacccgga gggcgcgagt agccccggca 2280 
gggccgcttc gacgcgatat cggctggcgc ggtgcgcaca atgggtgtcg ccggggcgag 2340 
gaaggaaggc caggtgcccc ggg 2363 



<210> 52 
<211> 990 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(987) 

<223> ORF23; dNDP-glucose-4,6-dehydratase 
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<400> 52 

atg egg att ctg gtc acc gqc gqa gec gqt ttc ate gqc teg cac tac 
Met Arg lie Leu Val Thr Gly Gly Ala Gly Phe lie Gly Ser His Tyr 
1 5 10 15 



gtq cac ttc gec gee gaa acc cac gtc gac cgc teg ate acc gqc teg 
val His Phe Ala Ala Glu Thr His val Asp Arg Ser lie Thr Gly Ser 
85 90 " 95 



ctg gee cgc gec tac cac cgc acc cac gqa ctg ccg gtq tgc ate acc 
Leu Ala Arg Ala Tyr His Arg Thr His Gly Leu Pro val Cys lie Thr 
165 170 175 



48 



gtt egg cag ttg etc gqt gqt gcg tac ccc gca ttc gee gac gee gac 96 
val Arg Gin Leu Leu Gly Gly Ala Tyr Pro Ala Phe Ala Asp Ala Asp 
20 25 30 

gtq gtc gtq etc gac aag etc acc tac gee gqc aac gag gcg aac ctg 144 
Val Val Val Leu Asp Lys Leu Thr Tyr Ala Gly Asn Glu Ala Asn Leu 
35 40 45 

gcg ccg gtc gcg gac aac ccc egg ctg aag ttc gtc tgc gqc gac ate 192 
Ala Pro val Ala Asp Asn Pro Arg Leu Lys Phe val Cys Gly Asp lie 
50 55 " 60 

tgc gac cgc gaa ctg gtt gqc gqc ctg atg tec gqc gtq gac gtq gtq 240 
Cys Asp Arg Glu Leu val Gly Gly Leu Met Ser Gly val Asp val val 
65 70 75 80 



288 



gac gec ttc gtq ate acc aac gtq gtc gqc acc aac gtq ctg ctg cag 336 

Asp Ala Phe val lie Thr Asn val val Gly Thr Asn val Leu Leu Gin 

100 105 110 

gee gcg etc gac gec gag ate gqc aag ttc gtq cac gtt tec acc gac 384 

Ala Ala Leu Asp Ala Glu lie Gly Lys Phe val His val Ser Thr Asp 

115 120 125 

gag gtc tac gqc tec ate gag gac gqc teg tgg ccc gaa gac cac gcg 432 

Glu val Tyr Gly Ser lie Glu Asp Gly Ser Trp Pro Glu Asp His Ala 
130 135 140 

ctg gag ccg aat tec ccg tac teg gcg gcg aaa gcg gqc teg gac ctg 480 

Leu Glu Pro Asn Ser Pro Tyr ser Ala Ala Lys Ala Gly Ser Asp Leu 
145 150 155 160 



528 



672 



cgc tgc tec aac aac tac gqg ccc tac cag ttc ccg gag aag gtq ctg 576 

Arg Cys Ser Asn Asn Tyr Gly Pro Tyr Gin Phe Pro Glu Lys val Leu 

180 185 190 

ccg ctg ttc ate acg aac ctg atg gac gqc age cag gtq ccg etc tac 624 

Pro Leu Phe lie Thr Asn Leu Met Asp Gly Ser Gin Val Pro Leu Tyr 

195 200 205 

gqc gac gqg etc aac gtq egg gac tgg ctg cac gtc age gac cac tgc 

Gly Asp Gly Leu Asn Val Arg Asp Trp Leu His Val ser Asp His Cys 
210 215 220 

egg gqc ate cag ctg gtq gee gac tec gqg cgc gcg gqc gag ate tac 720 

Arg Gly lie Gin Leu Val Ala Asp Ser Gly Arg Ala Gly Glu lie Tyr 
225 230 235 240 

aac ate ggc ggc ggc acc gag ctg acc aac aac gag ctg acc gag egg 768 
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Asn lie Gly Gly Gly Thr Glu Leu Thr Asn Asn Glu Leu Thr Glu Arg 
245 250 255 

ctg ctg gca gag ctg ggc etc gac tgg teg gtg gtg egg ccg gtc acc 816 
Leu Leu Ala Glu Leu Gly Leu Asp Trp Ser vat vai Arg Pro val Thr 
260 265 270 

gac cgc aag ggc cac gac cgc cgc tac teg gtq gac cac age aag ate 864 
Asp Arg Lys Gly His Asp Arg Arg Tyr Ser Val Asp His Ser Lys lie 
275 280 285 

gtc gag gaa ctg gqg tac gcg ccg cag gtc gac ttc gag acc ggg ctg 912 
val Glu Glu Leu Gly Tyr Ala Pro Gin Val Asp Phe Glu Thr Gly Leu 
290 295 300 

cgc gag aca ate cgc tgg tac cag gac aac egg gac tgg tgg gag ccg 960 
Arg Glu Thr lie Arg Trp Tyr Gin Asp Asn Arg Asp Trp Trp Glu Pro 
305 310 315 320 

ctg aag gec cga teg gcg gtg get cga tga 990 
Leu Lys Ala Arg Ser Ala val Ala Arg 
325 

<210> 53 
<211> 329 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 53 

Met Arg lie Leu Val Thr Gly Gly Ala Gly Phe lie Gly Ser His Tyr 
1 5 10 15 

val Arg Gin Leu Leu Gly Gly Ala Tyr Pro Ala Phe Ala Asp Ala Asp 
20 25 30 

Val Val Val Leu Asp Lys Leu Thr Tyr Ala Gly Asn Glu Ala Asn Leu 
35 40 45 

Ala Pro Val Ala Asp Asn Pro Arg Leu Lys Phe Val Cys Gly Asp lie 
50 55 60 

Cys Asp Arg Glu Leu val Gly Gly Leu Met Ser Gly Val Asp Val Val 
65 ~ 70 75 80 

val His Phe Ala Ala Glu Thr His val Asp Arg Ser He Thr Gly Ser 
85 90 95 

Asp Ala Phe val lie Thr Asn val val Gly Thr Asn Val Leu Leu Gin 
100 105 ' 110 

Ala Ala Leu Asp Ala Glu lie Gly Lys Phe val His val Ser Thr Asp 
115 120 125 

Glu Val Tyr Gly Ser lie Glu Asp Gly Ser Trp Pro Glu Asp His Ala 
130 135 140 

Leu Glu Pro Asn ser Pro Tyr ser Ala Ala Lys Ala Gly Ser Asp Leu 
145 150 155 160 

Leu Ala Arg Ala Tyr His Arg Thr His Gly Leu Pro Val Cys lie Thr 
165 " 170 175 
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Arg Cys Ser Asn Asn Tyr Gly Pro Tyr Gin Phe Pro Glu Lys Val Leu 
180 185 190 

Pro Leu Phe lie Thr Asn Leu Met Asp Gly Ser Gin val Pro Leu Tyr 
195 200 205 

Gly Asp Gly Leu Asn val Arg Asp Trp Leu His val Ser Asp His Cys 
210 215 220 

Arg Gly lie Gin Leu val Ala Asp Ser Gly Arg Ala Gly Glu lie Tyr 
225 230 235 240 

Asn lie Gly Gly Gly Thr Glu Leu Thr Asn Asn Glu Leu Thr Glu Arg 
245 250 255 

Leu Leu Ala Glu Leu Gly Leu Asp Trp Ser val Val Arg Pro Val Thr 
260 265 270 

Asp Arg Lys Gly His Asp Arg Arg Tyr Ser Val Asp His Ser Lys lie 
275 280 285 

Val Glu Glu Leu Gly Tyr Ala Pro Gin Val Asp Phe Glu Thr Gly Leu 
290 295 300 

Arg Glu Thr lie Arg Trp Tyr Gin Asp Asn Arg Asp Trp Trp Glu Pro 
305 310 315 320 

Leu Lys Ala Arg Ser Ala Val Ala Arg 
325 



<210> 54 
<211> 918 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> CD--C915) 

<223> ORF24; dNDP-4-keto-6-deoxygl ucose-3 , 5-epimerase 
<400> 54 

atg agt cgc etc gec gtq ctg gtq ccc gqc gqc cgc gqc cag ctg gqc 48 
Met Ser Arg Leu Ala Val Leu val Pro Gly Gly Arg Gly Gin Leu Gly 
15 10 15 

teg gag ctg gec egg ate etc ccc gcg egg acg gqg gcg ctg gtq cac 96 
Ser Glu Leu Ala Arg lie Leu Pro Ala Arg Thr Gly Ala Leu val His 
20 25 30 

egg ccg gqt tec gqg gaa ctg gac gtc ace gac gec gag gag gtc gee 144 
Arg Pro Gly Ser Gly Glu Leu Asp Val Thr Asp Ala Glu Glu val Ala 
35 40 45 

gac gcg ttg gqt tec ttc gcg gag acg gcg aag gac gcg gag ctg cga 192 
Asp Ala Leu Gly Ser Phe Ala Glu Thr Ala Lys Asp Ala Glu Leu Arg 
50 55 60 

ccg gtq gtq ate aac gec gcg gcg tac acg gcg gtq gac gcg gec gag 240 
pro val val lie Asn Ala Ala Ala Tyr Thr Ala val Asp Ala Ala Glu 
65 70 75 80 
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tec gac ccg gac cgc gcg gec egg ate aac gee gaa ggc gcg gec teg 288 

Ser Asp Pro Asp Arg Ala Ala Arg lie Asn Ala Glu Gly Ala Ala Ser 

85 90 95 

ctg gcg aaa gcg tgc egg age age ggt ctg ccc ctg gtg cac gtg teg 336 

Leu Ala Lys Ala Cys Arg Ser Ser Gly Leu Pro Leu Vat His val Ser 

100 ~ 105 110 

acg gat tac gtq ttc ccc gqt gat ggg gec egg ccg tac gag ccg acg 384 

Thr Asp Tyr val Phe Pro Gly Asp Gly Ala Arg Pro Tyr Glu Pro Thr 
115 120 125 

gac ccg acc gqg ccg cga teg gtc tac gqg cgc acc aag etc gaa ggc 432 

Asp Pro Thr Gly Pro Arg ser Val Tyr Gly Arg Thr Lys Leu Glu Gly 
130 135 140 



gaa egg gec gtq ctg gag tec ggc gcg egg gec tgg gtg gtg cgc acg 
Glu Arg Ala val Leu Glu Ser Gly Ala Arg Ala Trp Val val Arg Thr 
145 150 155 160 



480 



gca tgg gtg tac ggc gcg age ggc aag aac ttc ctg aaa acg atg ate 528 
Ala Trp val Tyr Gly Ala Ser Gly Lys Asn Phe Leu Lys Thr Met lie 
165 170 175 

cgc etc teg ggg gag cgc gac acg ctg tec gtt gtg gac gat cag ate 576 
Arg Leu ser Gly Glu Arg Asp Thr Leu Ser val val Asp Asp Gin lie 
180 185 190 

ggc teg ccg act tgg gcg gcg gac ctg gcg age ggc ctg ctg gag ctg 624 
Gly ser Pro Thr Trp Ala Ala Asp Leu Ala Ser Gly Leu Leu Glu Leu 
195 200 205 

gec gaa egg gtc gee gaa cgc cgt gga ccg gag cag aag gtg ctg cac 672 
Ala Glu Arg val Ala Glu Arg Arg Gly Pro Glu Gin Lys Val Leu His 
210 215 220 

tgc acc aat tec ggc cag gtg acc tgg tac gag ttc gcg egg gcg ate 720 
Cys Thr Asn ser Gly Gin Val Thr Trp Tyr Glu Phe Ala Arg Ala lie 
225 230 235 240 

ttc gcg gaa ttc ggc ctg gac gag aac cgc gtc cac ccg tgc acg acg 768 
Phe Ala Glu Phe Gly Leu Asp Glu Asn Arg Val His Pro Cys Thr Thr 
245 250 255 

gcg gac ttc ccc etc ccg gcg cac cgc ccg gee tac teg gtc ctg tec 816 
Ala Asp Phe Pro Leu Pro Ala His Arg Pro Ala Tyr ser Val Leu ser 
260 265 270 

gac gtg gcg tgg cga gag gcg ggc ctg acc ccg atg cgc acc tgg egg 864 
Asp Val Ala Trp Arg Glu Ala Gly Leu Thr Pro Met Arg Thr Trp Arg 
275 280 285 

gaa gec ctg gcg gcg gec ttc gag aaa gac ggc gaa acc etc cga acc 912 
Glu Ala Leu Ala Ala Ala Phe Glu Lys Asp Gly Glu Thr Leu Arg Thr 
290 295 300 

cgc tga 918 

Arg 

305 

<210> 55 
<211> 305 
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<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 55 

Met Ser Arg Leu Ala Val Leu val Pro Gly Gly Arg Gly Gin Leu Gly 
15 10 15 

Ser Glu Leu Ala Arg lie Leu Pro Ala Arg Thr Gly Ala Leu val His 
20 25 ~ 30 

Arg Pro Gly Ser Gly Glu Leu Asp Val Thr Asp Ala Glu Glu val Ala 
35 40 45 

Asp Ala Leu Gly Ser Phe Ala Glu Thr Ala Lys Asp Ala Glu Leu Arg 
50 55 60 

Pro val val lie Asn Ala Ala Ala Tyr Thr Ala Val Asp Ala Ala Glu 
65 70 75 80 

Ser Asp Pro Asp Arg Ala Ala Arg lie Asn Ala Glu Gly Ala Ala Ser 
85 - 90 95 

Leu Ala Lys Ala Cys Arg Ser Ser Gly Leu Pro Leu Val His Val Ser 
100 105 110 

Thr Asp Tyr val Phe Pro Gly Asp Gly Ala Arg Pro Tyr Glu Pro Thr 
115 120 ' 125 

Asp Pro Thr Gly Pro Arg Ser Val Tyr Gly Arg Thr Lys Leu Glu Gly 
130 135 140 

Glu Arg Ala val Leu Glu Ser Gly Ala Arg Ala Trp val Val Arg Thr 
145 150 155 160 

Ala Trp val Tyr Gly Ala Ser Gly Lys Asn Phe Leu Lys Thr Met lie 
165 170 175 

Arg Leu Ser Gly Glu Arg Asp Thr Leu Ser Val val Asp Asp Gin lie 
180 ~ 185 190 

Gly Ser Pro Thr Trp Ala Ala Asp Leu Ala Ser Gly Leu Leu Glu Leu 
195 200 205 

Ala Glu Arg Val Ala Glu Arg Arg Gly Pro Glu Gin Lys val Leu His 
210 ~ 215 220 

Cys Thr Asn Ser Gly Gin Val Thr Trp Tyr Glu Phe Ala Arg Ala lie 
225 230 235 240 

Phe Ala Glu Phe Gly Leu Asp Glu Asn Arg val His Pro Cys Thr Thr 
245 250 255 

Ala Asp Phe Pro Leu Pro Ala His Arg Pro Ala Tyr ser Val Leu Ser 
260 265 270 

Asp val Ala Trp Arg Glu Ala Gly Leu Thr Pro Met Arg Thr Trp Arg 
275 280 285 

Glu Ala Leu Ala Ala Ala Phe Glu Lys Asp Gly Glu Thr Leu Arg Thr 
290 295 300 

Arg 
305 
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